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COMBINATORIAL MODELS OF CREATION ANNIHILATION 



PAWEL BLASIAK AND PHILIPPE FLAJOLET 



Abstract. Quantum physics has revealed many interesting formal proper- 
ties associated with the algebra of two operators, A and B, satisfying the 
partial commutation relation AB — BA = 1. This study surveys the relation- 
ships between classical combinatorial structures and the reduction to normal 
form of operator polynomials in such an algebra. The connection is achieved 
through suitable labelled graphs, or "diagrams" , that are composed of ele- 
mentary "gates" . In this way, many normal form evaluations can be system- 
atically obtained, thanks to models that involve set partitions, permutations, 
increasing trees, as well as weighted lattice paths. Extensions to (/-analogues, 
multivariate frameworks, and urn models are also briefly discussed. 
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1. Introduction 

The theme of our study is extremely simple. It consists in investigating some 
of the formal consequences of the partial commutation relation between two oper- 
ators A and B (belonging to some unspecified algebra of operators): 

(1) [A,B] = 1. 

Here [U, V] :— UV — VU is the Lie bracket and 1 represents the identity operator. 
The relation (0 will henceforth be called the creation-annihilation axiom. Indeed, 
in quantum physic^J it is satisfied by the annihilation and creation operators, a and 
aJ , which are adjoint to each other and serve to decrease or increase the number of 
photons by 1: in this context, one should take A = a and B = a) . 

From an algebraic standpoint, one thus considers the polynomial ring C(A, B) in 
non-commuting indeterminates A, _B, and makes systematic use of the reduction of 
AB — BA — 1 to 0. (Technically, this corresponds to taking the quotient C(A, B) /X, 
of the ring C(A,B) of polynomials in non-commuting indeterminates by the two- 
sided ideal X generated by AB — BA — 1.) It is then possible to regard as a 
directed rewrite rule 

(2) AB — ► 1 + BA, 

by which any non-commutative polynomial t) in indeterminates A, B is completely 
reduced to a unique normal form, 9T(fj), where, in each monomial, all the occur- 
rences of B precede all the occurrences of A. 
For instance, the chain 

ABABA — > AB(l + BA)A = ABBAA + ABA 

(3) — > ABBAA + A + BAA — > BABAA + A + 2BAA 
— > BBAAA + 3BAA + A 

proves that Vl(ABABA) = B 2 A 3 +3BA 2 + A. (At each step, the pair AB involved 
in a reduction has been underlined.) It is precisely this non-trivial rearrangement 
process, known in quantum physics as normal ordering, which we propose to ex- 
amine. 

A particular realization of the commutation relation is obtained by choosing 
some sufficiently general space {f(x)} of smooth functions (typically, the class of 
C°°(0,1) of infinitely differentiable functions over the interval (0,1), or the space 
C[a;] of polynomials in indeterminate x), on which two operators, X and D are 
defined: 

(Xf)(x)=xf(x); (Df)(x) = ^f(x). 



1 The present paper deals with one or a finite number of pairs of such operators (known as 
"modes"), a situation that is directly relevant to the "second quantization" of quantum theory, 
in particular, in quantum optics; see, for instance, Louisell's book |89| . The case of a contin- 
uum of modes, which lies at the basis of quantum field theory and is associated with Feynman 
diagrams 43, 44 95 132 , is not touched upon here. The developments in our study require a 
modicum of combinatorial theory but no a priori knowledge of quantum physics. 
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Then the creation-annihilation axiom is obviously satisfied by A = D and B = 
X: one recovers in this way the familiar Weyl relations [-D, X] = 1 of abstract 
differential algebra |112l Ch. 1] . The interest of such a differential model of creation- 
annihilation is that it is "faithful" , meaning that any identity that holds in it 
(without any additional assumption regarding the space of functions) is true in all 
generality under ([I]). This differential view will prove central to our combinatorial 
developments. 

We henceforth adopt the more suggestive differential terminology, A H> D and 
B H> X. We shall concern ourselves here with the reduction to normal form of a 
variety of expressions such as 

(4) (XD) n , (X 2 D) n , (D + Xy\ (X 2 D 2 ) n , {D 2 + X 2 ) n , 

and many more. Observe that, by taking an exponential generating function, the 
collection of all the reductions associated with a family (io n ) n >o is summarized by 
the reduction of e zf) . 

Our starting point, to be developed in Section [2] is a combinatorial representa- 
tion of any normal-form monomial m = X r D s by a basic graph, called a "gate" . 
The reduction of a polynomial in operators D,X (for instance, any of Q) can 
then be regarded combinatorially as the process of building a whole collection of 
labelled diagrams, which are those special graphs assembled from a fixed collection 
of elementary gates (the ones arising from the monomials in Q). From here, as 
we shall sec repeatedly, obtaining the coefficients in the normal form of an operator 
expression is equivalent to enumerating complex graphs built from a fixed collection 
of gates. In other words, we are going to explore the following Equivalence Principle 
(see Theorem [l] below for a precise formulation) : 



Normal ordering 

(operator algebra) 



Diagram enumeration 

(combinatorics) 



In so doing, we build on classical works in combinatorial analysis relative to the com- 
binatorics of differential operators; see, for instance, Joyal's theory of species [74] . 
its exposition in the book by Bergeron, Labelle and Leroux [7], or the recent book 
by Flajolet and Sedgewick [5T], whose notations we adopt. 

The idea of using graphs to model creation-annihilation is not new. The most 
celebrated originator of the representation of physical processes by labelled graphs 
is R.P. Feynman, who used it as a convenient book-keeping tool for perturbative 
expansions of quantum electrodynamics [43 1 1 1 and the whole idea is is neatly artic- 
ulated by Baez and Dolan in [3J pp. 46-49] . Seen under the interesting perspective 
of [3J, what we propose to do is "decategorize" the abstract creation-annihilation 
theory by providing concrete combinatorial models of this theory. Our standpoint 
is however different in that we place a strong emphasis on connections with classical 
combinatorial structures and aim at developing exactly solvable models, hence ex- 
plicit formulae. With this in mind, we tread in the steps of earlier works of Blasiak, 
Duchamp, Horzela, Mendez, Penson, and Solomon jTUl ED QH E3 ED H IM] for 
which the present article can serve inter alia as a synthetic and systematic review. 

Our purpose is to shed light on calculations developed within quantum physics 
and do so by means of adapted combinatorial models. For instance, set partitions 
(with their associated Bell and Stirling numbers) naturally arise from diagrams 
corresponding to powers of XD, a fact largely known in combinatorics and finite 
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Structure Generating function type 



(X + D) n involutions e z /2 i 

(XD) n set partitions e e _1 [ 

(X 2 + D 2 ) n alternating cycles 1 \ 



\7cos(2z) 



(X r D) n increasing trees exp ( _ _ 1)z)1/(r _ 1} - 1 ) ! 



(X 2 D) n permutations exp ( — - — ) I 



5.1 
6 



6.1 



FIGURE 1. Some normal ordering problems relative to S) n , the corre- 
sponding combinatorial structures, a representative counting generating 
function, and the relevant section in this article. 



difference calculus, which is examined, under a quantum field angle, by Bender, 
Brody, and Meister in [5], and is further treated by Mendez, Blasiak, and Penson 
in [99]. Also, by providing a complete permutation model, we explain and extend 
calculations done by Mehta [SS] that are relative to the normal ordering of 



e 



'■Q(x,D) ^ where q( X) £>) = aD 2 + p X 2 + lXD 



Finally, the combinatorial approach we advocate provides, in a number of cases, 
a transparent alternative to the Lie algebra approach exemplified by Wilcox [133J. 
Figure [T] lists a representative set of operators discussed in this paper, together with 
the underlying combinatorial structures and the corresponding generating function 
types, as summarized in each case by a prototypical instance. 

Plan of the paper. Section [2] presents the basic combinatorial model of gates 
and diagrams, by which normal ordering is reduced to counting problems. It also 
contains a brief reminder of the symbolic approach to combinatorial enumeration 



via generating functions ([2.4). We then consider the normal ordering problem 
relative to f) n in increasing order of complexity of the polynomial f). Section [3] 
concerns the simplest case of linear forms in X, D, for which the natural model is 
seen to be that of (coloured) involutions. In Section |4j we proceed with the special 
quadratic form (XD), which is tightly coupled with set partitions. The general 
quadratic form in X, D treated in Section [5] leads to "zigzags", which are structures 
related to alternating permutations, and to local order patterns in (unconstrained) 
permutations. Section [6] shows that semilinear operators of the form (cj)(X)D) are 
in general modeled by simple varieties of trees and forests. Section [7] momentarily 
departs from our main thread and introduces lattice path models: it presents a 
direct treatment of binomial forms, including the interesting "Fermat case" {X' r + 
D r ), which is conducive to continued fraction representations. Section [8] discusses 
two frameworks that are closely related to diagrams: one is the rook placement 
model, for which a general scanning algorithm schema provides an alternative to 
the methods of Section [7] the other is the planar embedding of diagrams, which 
leads to a systematic approach to many ^-analogues of combinatorial theory. We 
conclude in Section [9] with a brief discussion of multivariate extensions of the gates- 
and-diagram model. 
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The present paper contains few new results. Rather, it is an attempt at a 
synthetic presentation of the combinatorial approach to normal ordering problems, 
which has been the subject of an extensive literature in recent decades. What 
we found striking, when preparing this paper, is the fact that most (perhaps all?) 
explicitly known expansions in this orbit of questions can be put in correspondence 
with classical or semi-classical combinatorial models. As we hope to demonstrate, 
one of the interests of the combinatorial approach in this range of problems is to 
bring clarity into what should be explicit and what is not likely to be. 



In this section, we associate with each polynomial in the operators X, D a finite 
collection of elementary 11 gates" that are graphs with just one inner vertex, as in |5]) 
below. We then define "diagrams" that are complex graphs built by assembling 
elementary gates according to certain rules. (Our terminology is inspired by that 
of digital circuits composed of elementary boolean gates.) A fundamental and 
easy theorem (Theorem [I]) then relates the normal ordering of the powers F) n of a 
polynomial f) to the enumeration of labelled diagrams of size n that can be built 
out of the gates associated with f). 

2.1. Gates, diagrams and labelling. We make use of the standard definitions 
of graph theory, as found, for instance, in [33]. We however need to extend it some- 
what. Technically, our diagrams are directed multigraphs — edges arc directed, mul- 
tiple edges between two vertices are allowed — that are further enriched by placing 
an ordering on the edges stemming from a vertex and similarly for edges leading 
into a vertex. (Put otherwise, we are considering graphs given together with an 
embedding into the plane.) We recall that the indegree and outdegree of a vertex 
are, respectively, the number of incoming and outgoing edges of that vertex. In 
what follows, we freely make use of graphical representations to illustrate our basic 
notions, while avoiding long formal developments; see Figure [2j 

Definition 1. A gate of type (r,s) consists of a vertex, called the inner node, to 
which is attached an ordered collection of r outgoing edges and an ordered collection 
of s ingoing edges. The vertices, not the inner node, that have indegree are called 
the inputs; the vertices, not the inner node, that have outdegree are called the 
outputs. 

A gate thus comprises r + s + 1 nodes in total. The inner node is conventionally 
coloured in black (•), whilst the input and output nodes are coloured in grey (A) 
and white (A) respectively. For instance: 



2. Diagrams, normal ordering, and enumeration 




gate of type (r, s) : 




A 



r outputs 



s inputs. 



In the graphical representation, edges are systematically (and implicitly) oriented 
from bottom to top; they are naturally ordered among themselves, conventionally, 
from left to right. 
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FIGURE 2. Left: a collection H of two gates of type (2,2) and (2,1). 
Right: a labelled diagram 8 on the basis T-i that comprises three com- 
ponents. 



Definition 2. A diagram is a directed multigraph, which is acyclic (i.e., has no 
directed cycle) and is such that, for each vertex, an ordering has been fixed on 
its incoming and on its outgoing edges. In addition, there are a designated set of 
inputs ( vertices with indegree and outdegree 1 ) and a designated set of outputs 
(vertices with indegree 1 and outdegree 0). The vertices that are neither inputs nor 
outputs are called inner nodes. The size of a diagram is the number of inner node^\ 
it comprises. 

The colouring convention of gates, as in ([5]), is extended to the representation 
of diagrams; see Figure [2] for an instance. Note that a diagram is not necessarily 
connected: it may be comprised of several (weakly connected) components. Clearly, 
a diagram specifies the collection of gates it is composed of. 

Definition 3. Given a set % of gates, assimilated to a subset of Z>o x Z>o, a 
diagram 5 is said to have T-L as a basis if, for each outdegree-indegree pair (r, s) of 
an inner node of S, one has (r, s) G %. 

Finally, we need to introduce a crucial notion of labelling, which is consistent 
with the standard one of combinatorial analysis [7J (STJ [SHI 11221 1134J , to which an 
additional monotonicity constraint is adjoined. 

Definition 4. A labelled diagram is comprised of an unlabelled diagram in the 
sense of Definition [1] together with an assignment of integer labels to inner nodes 
in such a way that: (i) all labels are distinct and they form an initial segment of 
Z>i; (ii) labels increase along any directed path. 

Note that a labelled diagram can be interpreted as representing the complete 
history of a particular incremental construction of an unlabelled diagram by suc- 
cessively "grafting" gates one after the other (Figure [3]), following the order given 



In what follows, we shall discount from size outer nodes A and A (which have size 0), and 
then often use the term "vertex" and "inner node" (or simply "node" ) interchangeably. 
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FIGURE 3. A particular grafting of a gate 8 = X r D 3 (here, r = 3, s — 
2) on a diagram 7 corresponding to a term X a D b (with a = 3, 6 = 4), 
in the case where two edges are matched (t = 2). 



by the labels — this viewpoint will prove useful in Subsection 2.3 The enumeration 
of labelled diagrams corresponding to a fixed basis H of gates is a central aspect of 
our approach to the creation-annihilation algebrg^j 

2.2. The equivalence principle. We propose to state a general form of the Equiv- 
alence Principle (Theorem [l] below), in which gates and diagrams can be weighted. 
Fix a field IK, called the domain of weigths — it may be the field C of complex num- 
bers, in the case of numeric weights, or a field C(x, y, . . .) of rational functions in 
formal variables x, y, . . ., in the case of symbolic weights. A weighting of a collection 
H of gates is an assignment of a weight uv, s £ IK to each gate of Ti; equivalently, a 
function w : ri — > IK. Such a weighting on gates induces a weighting on diagrams: 
the weight w(5) of a diagram 5 is defined to be the product of the weights of the 
individual gates that the diagram is comprised of. Finally, the total weigh^ of a 
class T> of diagrams is J^sev w (d)- With this convention, we state: 

Theorem 1 (Equivalence Principle). Consider a polynomial f) with normal form 

(6) h:= ]T w r , s X r D s . 

(r,«)e« 

Then the normal ordering of the power h", 

(7) = E C n,a,bX a D\ 

n,a,b 

is such that the coefficient c n ^ a ,b coincides with the total weight of (monotonically) 
labelled diagrams that admit % as a basis weighted by w, have size n, and are 
comprised of a outputs and b inputs. 



As pointed out to us by Ch. Brouder and G. Duchamp (October 2010), an early ancestor 
of the diagrams presented here is to be found in the 1965 book of Friedrichs |57| relative to 
perturbation of spectra in Hilbert Space; see, especially, pages 55-56. We note also that the 
diagrams can be cast into the framework of |99| by a bijective correspondence "gates' 1 <-> "bugs" 
and "diagrams" <-> "colonies". 

4 Thus "total weight" extends the notion of "cardinality" or "number of elements" and deter- 
mining total weights can be regarded as a (generalized) enumeration problem. 
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monomial X r D" 


w gate of type (r, s) 


polynomial f) in X, D 


«*~+ weighted basis "H of gates 


f) n 


<™> labelled diagrams of size n on rl 


e*" 


<™» all diagrams (exp. generating function) 


(z n X a D b ) 


<~"» (size = n, ^outputs = a, ^inputs = b). 



FIGURE 4. The correspondences between normal forms of algebraic 
expressions in X,D,z (left) and combinatorial diagrams (right). 



The proof is deferred until the next subsection. The interest of Theorem [T] 
lies in the fact that it transforms an algebraic normal-ordering problem into a 
combinatorial enumeration problem, one that may be studied by the methods of 
combinatorial analysis. It is precisely our aim to explore aspects of this equivalence 
and the derivation of explicit normal forms, based on combinatorial theory. 

We note that Theorem [I] can be rephrased in terms of generating functions. 
First, by §f§, the quantity 9t(r) n ) is the generating polynomial of labelled diagrams 
with X marking outputs, and D marking inputs: as in the case of gates, a diagram 
with a outputs and b inputs corresponds to a term X a D b in the normal form. 
Introduce next the operator exponential 

which is a formal power series in z with coefficients in the polynomial ring C (X, D) 
and the corresponding normal form 

ui(e zt) ) = Ym{t) n )^. 

^— ' nl 

n>0 

Then, equation ^ expresses the fundamental identity 
(8) !K(e**) = (E c ».^ uV S) 

The sum on the right-hand side is nothing but the exponential generating function 
(EGF) of the total weight (number) of diagrams built on the weighted basis H, 
where z marks size, u marks the number of outputs, and v marks the number of 
inputs. In writing such equalities, it is understood that all occurrences of X should 
be systematically written^] to the left of all occurrences of D. 

Theorem [T] thus expresses a general correspondence between the world of oper- 
ators and the realm of combinatorics, which is summarized in Figure [4] 



Thus, one calculates generating functions of diagrams in the usual way, then, at the end, 
one should group all occurrences of u before those of v, and finally perform the substitution 
u — > X, v — > D. If G(z;u,v) is the generating function of diagrams, a notation often used by 
physicists to indicate this process is 

: G(z;X,D) : 

which plainly represents a commutative image, with all Xs conventionally preceding all Ds, then 
interpreted back as an operator in non-commuting X, D. 
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2.3. Proof of the Equivalence Principle (Theorem [T]). For the proof of The- 
orem [T] a basic observation is the identity 

(9) (X r D s )(X a D b ) = Q (^Jt!X r+a - t D s+b - t . 

Without loss of generality, it suffices to consider the case r = b = 0. Then, with 
the interpretation Df(x) = 4zf(x) and X fix) = xf(x), we find, by Leibniz's rule, 

(D*X")f(x) = D*(x"f(x)) = V(f)pV*)(Ds-V(aO) 



t=o v - 

= pQQ^-'iD^fix)). 

The last line implies Q, upon multiplying on the left by X r and on the right by D b . 

Let now X a D b be a monomial that figures explicitly in 9T(f)™) and let X r D s be 
a monomial of t). The reduction to normal form of f) n+1 (viewed as f) n+1 = t) ■ f) n ) is 
obtained by applying the rule ([£]), then summing over all values (r, s) 6 W. Assume 
now, by induction that the identity ([7| of Theorem [T] holds for a certain value of n, 
so that the coefficients of 91(f) 11 ) count (with suitable weights) all diagrams of size n. 
The collection of legal diagrams of size n + 1 is obtained by grafting in all possible 
ways a gate of the basis % on a diagram of size n. If this diagram 8 has a outputs 
and b inputs and the gate 7 is of type (r, s), then the number of ways that such 
a grafting can be effected, when t outputs of 5 are plugged into t inputs of 7 is 
exactly 

(:)(:>■ 

(The binomials count the choices of the t inputs of 7 and the choices of the t 
outputs of 5 that are matched; the factorial counts the possibilities of attachment.) 
Figure [3] displays a particular attachment of a gate of type (3, 2) to a connected 
diagram with 3 outputs and 4 inputs. 

It is then observed, from a comparison of (|9| and ( 10 ) that the multiplicities 



induced either by a reduction of a left multiplication by X r D s or by adding a gate 
of type (r,s) are the same. Thus, the property that 9T(f)™ +1 ) is the generating 
function of diagrams of size n + 1 with X marking outputs and D marking inputs 
is established. The property trivially holds for 01(f)), ensuring the basis of the 
induction. The proof of Theorem [T] is now complete. 

Note 1. The combinatorics of derivatives and Wick's Theorem. The usual rule of calculus 
Dx n = nx n ~ x has combinatorial content. For instance, D(x 4 ) = 4a; 3 can be obtained as 

Dixxxx) = fcxxx + xfcxx + xxfcx + xxxfc, 

corresponding to the formal rule: select in all possible ways an occurrence of the variable 
(x) and replace it by a neutral element (1). This lifts to arbitrary classes of combinatorial 
objects as a general "pointing-erasing" operation [511 p. 201], which applies to "atoms" 
(basic components of size 1, such as letters of words or vertices of graphs) that compose 
combinatorial structures. 

Equipped with this combinatorial interpretation of D, we can proceed to revisit the 
normal form problem. As before X is the operator of multiplication by the variable x (i.e., 
(X f)(x) = xf(x), for an arbitrary function f(x)). Combinatorially, if / is regarded as 
representing an arbitrary combinatorial class whose elements are made of x-atoms, then 
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Xf means adjoining — or "creating" — one new atom for each element of /. Consider now 
the normal form of an expression such as DX 2 . It can be obtained by working out what 
DX 2 f is: first X 2 f adds two atoms; next, when D is applied, it must either pick up 
and erase one of these added z-atoms or hit and destroy an atom of /. In summary, the 
external D either "annihilates" one occurrence of a following X or it gets directly applied 
to / itself. Pictorially, 

DX 2 f = fijCXf + f>XXf + DXXf = 2Xf + X 2 Df, 

so that 

m{DX 2 ) = 2X + X 2 D. 

Similar developments apply to arbitrary monomials in D and X, giving rise to what is 
known in quantum physics as Wick's Theorem and is usually expressed in terms of the 
annihilation a and creation a' operators. (Here, we can interpret a as D and a 1 as X.) 

Proposition 1 (Wick's Theorem). The normal form of a monomial in X,D equals the 
sum of all expressions obtained by removing in all possible ways an arbitrary number of 
pairs D . . .X of annihilation (D) and creation (X) operators, where D precedes X, then 
reorganizing the resulting monomials in such a way that all Xs precede all Ds. 

Such a removal is called a contraction. An example of a normal form computation in this 
framework (with : E : representing the commutative reorganization where all Xs precede 
all Ds, as in footnote©) is as follows: 

DXDDXD = : DXDDXD : 
s * ' 

no pair removed 

+ : 1PXDDXD + ]DXDD](D + DX^tPxD + DXDfyXD : 

1 pair removed 

+ : TpXiprPxD + 1PXD1PXD : 

2 pairs removed 

= X 2 D 4 +4XD 3 + 2 D 2 . 

(From here, the proof of Wick's Theorem is immediate: it suffices to proceed by induction 
on the length of the monomial to be reduced, distinguishing the two cases yi(Xw) and 
y\(Dw), where w in an arbitrary monomial in X,D.) 

The procedure that underlies Wick's Theorem may involve a large number of steps, as 
it amounts to enumerating all possible sets of contractions. The computational complexity 
is thus exponential in the worst case. By contrast, the combinatorial approach based on 
diagrams provides a graphical means to track down patterns in the diversity of Wick's 
contractions, which can then be effectively used to achieve a reduction in computational 
complexity and, in several cases, lead to closed-form solutions. 

Next, we can interpret gates in the same vein: D s corresponds to selecting a sequence 
of s distinct atoms and replacing each of them by the neutral element, whereas X r means 
adding r new atoms. A derivative (D) then "hooks" on atoms (X) or it "jumps" to the 
right. A particular labelled diagram composed of gates (71, . . . ,7»), with 7j associated 
with X rj D Sj , then corresponds to a particular expansion of 

(X rn D Sn ) ■ ■ ■ (X r2 D S2 ){X ri D ai ). 

Identities such as |9| then receive a natural interpretation and the construction of diagrams 
can be entirely developed in this way from first principles — this approach will be revisited 
in Section [§J when we discuss the g-difference operator A ■ 
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FIGURE 5. A diagram (left) and its dual (right). 



Note 2. Duality. There is an easy but important duality in reductions to normal forms. 
For noncommutative expressions in the two variables X, D, define an antimorphism ip by 
the rules 

(11) <P(X)=D, <p(D) = X, <p(U-V)=<p(V)-v(U), 

together with an extension to polynomials by linearity. Thus, for a monomial m, its image 
<p(m) is obtained by exchanging the roles of X and D as well as reading letters backwards. 
It is then observed that the basic quantity DX — XD — 1 is invariant under (p. As a 
consequence, any identity il = QJ (modulo DX — XD — 1) over terms it, 03 in X, D 
immediately implies a dual identity <p($X) = <£>(23). 

This duality has a natural graph interpretation. Define the dual graph F by reverting 
the arrows in r, relabeling the vertices • in reverse order (i.e., 1, 2, n— 1, n are changed 
into n, n — 1, 2, 1 respectively). The result is again a legitimate graph. See Figure[5]for 
illustration. Clearly, the dual graph is made of a basis of gates that is dual to the one of 
the original graph. 

A consequence of the foregoing considerations is that the normal forms of h" and of 
<p(t)) n involve the same coefficients, with the coefficient of X a D b in 9T(h n ) being identical 
to that of X b D a in m(tp(t)) n ). In this way, for instance, the normal forms we develop 
below for (X 2 D) n immediately translate to the case of (XD 2 ) n . (In physical contexts, 
duality is often associated with hermitian conjugacy; see Mikhailov's article [1011 §3] for 
several examples) ■ 

Note 3. The combinatorics of Taylor's formula. Here is a basic illustration of the 
combinatorics of derivatives. It is well known, from several areas of analysis and the 
calculus of finite differences^] that the exponential of a derivative plays the role of a 
translation operator — also known as "shift". Specifically, with the notations of Note[T] we 
consider the operator 

(12) T y :=e yD . 
Symbolically, we have, corresponding to Taylor's formula: 

(13) t v ■ f{x) = J2 ri D "f( x ) = E |r f in) W = K x + v)- 

n>0 ' n>0 



6 See, e.g., the treatises of Jordan |70l pp. 13-14] and Milne-Thomson |102l p. 33]. The interest 
of this symbolic view is inter alia to lead to an immediate proof of (a form of) the Euler-Maclaurin 
summation formula by a computation of (e yD — 1) — 1 ; cf 102, p. 260]. 
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It is piquant to note that the formula admits a transparent combinatorial interpreta- 
tion, in view of Note [l] Think of f(x) as being the generating function of a class T of 
combinatorial objects, themselves composed of atoms represented by x: 

The application of the operator ^D n means: "select in all possible ways an unordered 
collection of n atoms in each element of J- and replace these by neutral atoms" . The 
application of ^y n D n then translates as follows: "select in all possible ways a collection 
of n atoms of type x in each elements of J- and replace these by y-atoms" . The exponential 
e yD then corresponds to choosing an arbitrary number of x's and replacing them by y's, 
the outcome being exactly the bivariate generating function f(x + y). Thus, seen from 
combinatorics, Taylor's formula 

simply expresses the decomposition of a bicolouring process according to the number of 
atoms whose colour is changed. Figuratively: 

oo 

Bicolour^y [T] = "change n occurrences of x into y in elements of J r (x)" . 

n=0 

(This exercise appears, for instance, explicitly as Note III. 31 in |51l p. 201].) ■ 



Note 4. Normal forms and PDEs. The reduction of powers of differential operators to 
normal form is of interest in the analysis of certain types of partial differential equations 
(PDEs). Consider the initial value problem, also known as "Cauchy problem", 



f ^- t F(x,t) = rF(x,t), with rec[x,d], d = d x = £ 

F(x,0)=f(x), 



where T is a differential operator, which is a polynomial in x and d x . The solution is 
determined by the initial data f(x) at time t — 0. An equation of this sort is sometimes 
referred to as an evolution equation, with the specific choice of the linear differential 
operator F serving to model various physical contexts. Classical examples, in the one- 
dimensional case, are the heat equation, 

(15) d ^t =d di' 

and the Schrodinger equation with time-independent potential V = V(x): 

..dF h 2 8 2 F rr/ 

(16) ih m=-2 m 8x^ + v{x)R 



With the notations of (14 1, introduce the Ansatz 

(17) F = e tF f, 
where the operator exponential is classically defined as 

(18) e- := J2 
Proceeding formally, we verify that 

(Ifl f = (!e-)/ = re'r/ = r* 
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while, by construction, F reduces to / at t = 0. (Analytically, sound uses of such operator 
exponentials can be based on the theory of operator semigroups; see, for instance, the 
book by Engel and Nagel |41] . and especially its Chapter 6, for an accessible discussion.) 
If now the normal form of the powers T n can somehow be regarded as known, 



^ A n , with A n = a^^d , 

n=0 ' a,P 



then, a formal solution of the evolution equation (Tl4| is 



(20) 



n=0 a,0 



It is naturally a nontrivial matter to make analytic sense of ( 20 \ , which often proves to 
be a divergent expansion, but the formal solution ( 20 1 should at least provide valuable 
clues as to the kind of special function involved in the analytic solution of the evolution 
equation ( 14 1 ■ 



Note 5. The heat equation. To illustrate the usefulness of operator calculus, we briefly 
show how to rederiv^] formally the solution of the heat equation ( 15 1 in the present per- 
spective. In accordance with the Ansatz 1 17 1, one should consider the operator exponential 

„iS 2 



We choosey to examine the effect of the exponential-of-the-Laplacian e on the col- 
lection of base functions {e 1 " 1 }, so that 

t n 



tQ* it 

e ■ e 



= E^(M e ) 

n>0 

Then, if f(x) is a Fourier integral, 

/(*) = 



4>{x) du, 



Equation ( 20 I yields by linearity the formal solution 

e- mx (eT tu 4>{w)\ duj. 

-oo ' 

The function F(- ,t) thus appears as the Fourier transform of the product of the two 

2 

functions e w and cj>(u). This, by well known properties of the Fourier transform, which 
exchanges convolutions and ordinary product, leads (formally still) to the celebrated "heat 
kernel" solution 

(x-yf 



(21) 



F(x,t) = 



exp 



At 



f(y) d y- 



In this particular case of the heat equation, the normal form problem relative to (d 2 ) n 
is trivial. Nonetheless, the derivation above demonstrates the type of usage of operator 
exponentials, once these can be made sufficiently "explicit" . It is one of our goals to 
develop a combinatorial toolbox for the simplification of such exponentials, which could 
then serve as a preamble to the analysis of more complicated types of PDEs ■ 



7 What follows is extremely classical material (see, e.g., 1381 pp. 107-109]), only slightly rear- 
ranged to suit our needs. 

8 This choice actually corresponds to an cigcnfunction expansion in disguise. For instance, if 
r has known eigenvalues {A} with eigenfunctions {v(x)}, then e tr v(x) = e tx v(x), from which 
solutions can be composed by linearity. 
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2.4. Combinatorial enumeration. Throughout this paper, we appeal to general 
methods of combinatorial analysis relative to the enumeration of labelled objects 
and extensively base our discussion on the standard conventions of the book An- 
alytic Combinatorics |511 Ch. 2]. If C is a combinatorial class formed of labelled 
objects (typically, diagrams), we systematically let C n be the subclass of objects of 
size n, with C n the corresponding cardinality. The exponential generating function 
(EGF) of the class is 

r n |c| 

n>0 etc 1 1 

What we want to do is construct complex combinatorial classes from simpler 
ones. The initial classes include the atomic class Z, which comprises a single 
element of size 1 and has EGF z, as well as the neutral class £, which consists of a 
single element of size and has EGF 1 

Disjoint unions, henceforth written plainly as '+', clearly correspond to sums of 
EGFs: 

(22) C = A + B =► C(z) =A(z)+B[z). 

The labelled product C = A * B of two labelled classes is obtained by taking all 
the ordered pairs (a,/3), with a G A and /3 G B, then relabelling them in all 
order-consistent ways so as to obtain a well-labelled pair (a', /?'). We then have the 
correspondence 

(23) C = A * B => C(z) = A{z) ■ B{z). 

It is then possible to form the class of all (labelled) sequences ('Seq'), sets ('Set'), 
and cycles ('Cyc') with components in A, the corresponding EGFs being given by 
the following dictionary: 

1 

1 - W) 
exp(C(z)) 
1 







= Seq(^I) = 


=> C(z) 


(24) 




= Set(^) = 


=> C(z) 






= Cyc(^) = 


=> C(z) 



b l-C(z)- 

These basic constructions suffice to transcribe the normal form problem in simpler 
cases such as (X + D) in Section [3] and (XD) in Section [1] 

Finally, we shall need the modified "boxed product" construction, C = (A a *B), 
which corresponds to the subset of ordered pairs (a, /3) G (A*B), such that the 
smallest label is constrained to belong to the a component; see Greene's thesis [52] . 
as well as the accounts in [71 Ch. 7] and [STJ §11.6.3]. The translation rule from 
constructions to EGFs is 

(25) C = (A D *B), =^ C(z)= £ (~A(t)\ -B(t)dt, 

This covers in particular the min-rooting operation, which attaches an external 
atom to a i3-structure and assigns to it the smallest label: the construction is 
simply (Z a *B) and it corresponds to an integration operator J Q B(t) dt. The same 
translation applies to the dual boxed product C = (A m *B) : where it is now the 
largest label that is constrained to belong to the ,4-component. We shall make 
much use of these constructions, which correspond to a decomposition according to 
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o # i a # 5 



/i f 8 



FIGURE 6. A particular (X + D)-diagram. 



the first or last gate of a diagram: they are used typically in Section [5j relative to 
(X 2 + D 2 ), and Section|6| relative to (X 2 D). 

All the rules above extend to multivariate generating functions: these contain 
extra parameters, which can keep track of various additive characteristics of struc- 
tures [SU Ch. III]. 

In the next sections, we are precisely going to make use of the dictionary formed 



by (22), (|23j), (24), and ( |25| in order to enumerate various classes of diagrams 
associated (via Theorem[T]) with the normal ordering of terms in the operators X, D. 



3. Linear forms (X + D), involutions, and generalizations 

This section is dedicated to the normal ordering of expressions of the form 
(a(X) + D) n and (a(D) + X) n , with a( • ) a polynomial, starting with the eas- 
ier case (X + D) n . Thus, the schema is that of a base form in X and D, which 
is linear in at least one of the operators X,D. The combinatorial models turn out 
to be special coloured permutations (involutions and generalizations), for which the 
introduction of diagrams easily leads to fully explicit forms. 

3.1. The basic linear case (X + D). Linear forms in the operators X and D, 
namely, operators of the form 

T = aX + /3D, 

can serve to illustrate the usefulness of diagrams with minimal apparatus. In ac- 
cordance with the developments of Section [2] we are talking here of diagrams based 
on two types of gates: a gate X has no input and one output; a gate D has one 
input and no output: 

A 



aX 



a • 



pD 



I 



By inspection of all the possible ways of assembling gates, it is immediately seen 
that a diagram must be comprised of three types of components: isolated X -gates, 
isolated D-gates, and pairs DX, where a D is hooked to an earlier arrived X. In 
the labelled case, the isolated X and D may receive arbitrary labels, while the 
"saturated" DX pairs are only constrained by the fact that the label of the X is 
smaller than the label of the D; see Figure [6] 
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Combinatorially, the class of all diagrams is thus specified by 

G = Set(«Z + fiZ + a/3 Set 2 (Z)), 

when the multiplicative weights a, ft are taken into account. Here, the SET2 con- 
struction^] describes an unordered pair of two labels whose arrangement is imma- 
terial: the smaller label is necessarily attached to an A-gate, the larger one to 
a -D-gate; see Figure [6} The generating function of all weighted diagrams is accord- 
ingly 

(26) G{z) = e («+«-+"^ 2 /2_ 

An equivalent way of phrasing this result is as an equality, 

(27) m (e*( aX +P D A = e «^ 2 / 2 ■ e azX ■ e 0zE ', 

where the right-hand side is to be interpreted as a standard expansion in the com- 
muting indctcrminates A, D. A straight expansion then yields an explicit form 



of (27) 



E Z I a\k Z £ V £ Z omr^m 
Wk\ {aP) J\ a X m\ P ° ' 

k,i,m>0 

which, after collection of the coefficient of z n , can be summarized as follows^] 
Proposition 2. The normal form of (aX + f3D) n satisfies 
K((aX + pDT)=Y J I ( tix*D m , 

where, forn — i—m odd, the coefficients ij 1 ^ are 0, while, for n~ t~ m even, they 
satisfy 

r(") _ ^ (n+e-m)/2a(n+e+m)/2 

e, m 2( n - e - m )/ 2 {{n-Z-m)/2)\e\m\ P 
In combinatorics, a permutation cr containing only cycles of sizes 1 and 2 is 
known as an involution (it satisfies a 1 = 1). The specification and exponential 
generating functions are accordingly 

1= Set(Z + Cyc 2 (Z)), I(z) = e z+z2/2 ; 

see, e.g., [STj, p. 122]. The numbers therefore enumerate coloured involutions, 
where singleton cycles can receive any one of two colours (corresponding either to 
an I or a fl). In other words: the natural combinatorial model for the normal 
ordering of (X + D) n is that of (bicoloured) involutions. 

The classical result expressed by Proposition [2] is usually derived in the context 
of Lie groups by means of the Baker-Campbell-Hausdorff formula [551 IM], since, 
in this case, nested Lie brackets of higher order vanish. Indeed, it is known in this 
theory that, if the commutator C — [A,B] commutes with both A and B, one has 
the general identity (see (58J p. 463] and [64j p. 64]): 

e z(A+B) _ p zA _ e [B,A]z 2 /2 , &Z B _ 



" In this particular case, a CYC2 construction can equivalently be used, given the obvious 
combinatorial isomorphism SET2(.Z) = CyC2(.H). 

^ Proposition [2] is a classical result: it is for instance derived by means of operator calculus 
methods in Wilcox's paper j 1331 Eq. (10.43)], published in 1967. 

11 Formulae of this sort are sometimes known as "disentanglement formulae" 1291 . 
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Note 6. Solution of a special PDE by normal ordering of (X+D) n . The partial differential 
equation of interest is 

dF dF 



(28) 



dt 



dx 



+ xF, 



with initial value condition F(x, 0) = f(x). This is of the form considered when discussing 
operator exponentials in Note [4] p. |12| here with T — (X + D). The operator solution 



e /, under the normal ordering provided by Equation (27 1 above, becomes 



tr r 
e f = e 



t 2 /2 



tD r 



We know (from the earlier Note[3j p. Ill that the exponential of a derivative is a shift 



e f{x) = f(x + t), so that the solution to (281 is the fully explicit 



(29) F(x,t) = e t2/2+xt f(x + t) y 

whose validity is easily checked directly. 



No claim is made that the PDE (281 is hard to solve, and, indeed, it succumbs easily 
to the method of characteristics, which is generally applicable to linear and quasilinear 
PDEs 124, §1.15]. It is nonetheless instructive to observe the way the reduction to normal 
ordering (with the right factor e tD ) could automatically put us on the tracks of a general 
solution ■ 



3.2. Generalizations to (X + D r ) and [X r + D). We next examine the case 
of the operator (X + D 2 ). According to the main theorem, one should consider 
diagrams built out of two types of gates: D 2 -gates have two inputs and no output; 
X -gates have, as usual, one output and no input: 



X : 



D 




It is then easily realized that the only possibilities for connected diagrams are given 
by the following list: 

A 



(30) 






x 



D' 2 XD 



D'UX 



D 2 X 2 



For instance, D 2 X 2 represents a connected component that is "saturated" , in the 
sense that the two inputs of a £> 2 -gate are connected to the (earlier arrived) outputs 
of two X-gates; similarly, D 2 XU depicts a component, for which the output of an 
X-gate is plugged into the first input of a Z? 2 -gate, while the second input remains 
free; and so on. 

We can then look at the balance of inputs and outputs corresponding to each of 
the gates of the list (30). For instance, a D 2 XU component has one input gate (a 



dangling D) and no output, so that it is equivalent to a D, in terms of the balance 
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between number of inputs and number of outputs. Proceeding systematically, we 
determine the following table for all the possible types of connected components: 



(31) 



type : 


D 2 


X 


D 2 UX 


D' 2 XU 


D 2 X 2 


size : 


1 


1 


2 


2 


3 


balance : 


D 2 


X 


D 


D 


1. 



Next, we should examine the number of ways of placing labels on connected 
components. In the case of a D 2 UX or D 2 XD, the size equals 2, but the D 2 - 
gate is necessarily associated with the largest label; thus, any such component is a 
Set 2 (Z), itself combinatorially equivalent to a Cyc 2 (Z). In the case of a saturated 
component D 2 X 2 , two possible orderings are to be considered, since the first input 
of D 2 is associated with either the earlier arrived X or with the later X; in this 
case, the component turns out to be equivalent to a Cyc^IZ). Thus, using now, 
as standard commuting formal variables, u to mark components with a dangling 
output (X) and v to mark components with a dangling input (D), we have, for the 
various components, the specifications: 



(32) 



type : 


D 2 X D 2 DX 


D 2 XD D 2 X 2 


specification : 


v 2 Z uZ uCyc 2 (Z) 


vCyc 2 (Z) Cyc 3 (Z). 



There results that the class of all diagrams is described by 

g = Set (v 2 Z + uZ + v Cyc 2 (Z) + v Cyc 2 (Z) + Cyc 3 (Z)) , 
with corresponding EGF 

(33) G(z; u, v) = exp(v 2 z + uz + vz 2 /2 + vz 2 /2 + z 3 /3). 

In other words, the model is now that of (multicoloured) permutations, all of whose 
cycles are of length at most three, where, in addition, singletons and doubletons can 
be of any of two colours. 

The very same reasoning applies to the normal ordering of (X 2 + D) n . The dia- 
grams are isomorphic to the earlier ones, with inputs and outputs being exchanged 



and time being reversed. That is, the roles of X and D in (33) are simply to be 
exchanged — this is a special case of the general duality discussed in Note[2j p. 11 
In summary: 

Proposition 3. The normal orderings corresponding to the base operators (D 2 +X) 
and (X 2 + D) satisfy 



(34) 



ffl ( e z(D 2 +X)\ = e z 3 /3+zX . e z 2 D+zD 2 
m )z(X 2 +D)\ = e z 3 /3+z 2 X+zX 2 . e zD 



Finally, the explicit normal forms associated with (D + a(X)) and (X + a(D)), 
for a( ■ ) a polynomial, are accessible via a combinatorial calculus that suitably 



extends (31) and |32|): the combinatorial model now involves permutations with 
cycles of length at most r. We leave it as an exercise to the reader to work out 
details and stataHl 



12 Mikhailov I100| has a simple proof based on operator algebra in |100| . and he refers to earlier 
works of Witschel (1975) and Yamazaki (1952). 
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Proposition 4. The normal ordering corresponding to the base operators {a(D) 
X) and [a{X) 4- D) are given by 



(35) 



(e z(a 
(e z{a 



{p)+x)\ = e ^. exp 



a(v + w) dw 



(X) + D) 



exp 



a(X + w) dw 



In the pure binomial case of (D r + X) and (X r + D), the normal forms further 
simpliff^l as 



m L^+X)\ = e zX.^(_±_^ D + z) r+l_ D r+l] 



D 



which gives back (27) and (34), when m = 1,2. 



Note 7. Another PDE. The usual consequences for PDEs also hold, with the further 
simplification that e tD is a shift and that, for the quadratic case at least, the exponential 
of the Laplacian can be made explicit in the Fourier basis of complex exponentials. For 
instance, the general PDE schema 

OF dF 



dt dx 



+ a(x)F, 



admits the solution 

F(x,t) = e Q(x ' t) f(x + t), Q(x,t) 



a(x + w) dw. 



In particular, a(x) = x 2 corresponds to Q(x, t) — t 3 /3 + tx 2 + t 2 



4. The special quadratic form (XD), set partitions, and product forms 

This section is dedicated to the normal ordering of the powers (XD) n , a problem 
which has been recognized for a long time to be tightly coupled to set partitions and 
Stirling numbers of the second kind. In Appendix[XJ we summarize the main results 
contained in the rather remarkable thesis memoir of Heinrich Scherk, defended 
in 1823: the reduction of (XD) n figures there explicitly! We also consider, in 
this section, the related quadratic form XD + g(X + D). Finally, we show that the 
combinatorial approach advocated here extends rather easily to powers of operators 
such as X 2 D 2 7 X 3 D 3 , and so on, which relates to several combinatorial enumeration 
problems of independent interest. 

4.1. The form (XD) and set partitions. By the general isomorphism theorem, 
the operator (XD) corresponds to a gate with one input and one output: 

A 



XD 



A 

With these, we can form chains where the input of an XD-g&te is connected to 
the output of a previously arrived XD-g&te. When time stamps are put on the 
vertices of the gates, a connected component becomes an increasing linear graph 

These formulae have also been found independently by Karol Penson (unpublished, 2004). 
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A 



10 



O 8 



it 7 



FIGURE 7. A particular diagram associated with (XD). 



(see, e.g., [5TJ p. 99]), in the sense that vertices are linearly arranged and labelled in 
increasing order, with the additional condition that size needs to be at least one: see 
Figure[7j Note that edges between adjacent elements in a connected component are 
redundant and hence each chain can be represented by the nonempty set of labels. 
Here is a particular representation: 









10 


11 




6 


8 




4 


2 




3 



® 



Under this form one recognizes the classical structure of set partitions, where a 
partition of a set is a subdivision of the elements of the set into indistinguishable 
non-empty classes, also known as "blocks" [351 \ST[ [ST] . (Here the components have 
been presented in increasing order of their leading elements.) 

Components can be equivalently regarded as non-empty unordered collections 
of labels (non-empty "urns" in the terminology of [5TJ p. 99]); that is, they are 
specified by JC — Set>i(Z). An arbitrary graph built out of ATD-gates is then an 
unordered collection of such components, so that the corresponding class is Set(/C). 
With z marking the size of a graph (its number of gates, equivalently, vertices) and u 
marking the number of connected components (here equal to the number of inputs 
and to the the number of outputs) , we thus have for diagrams the specification 

Q = Set (uSet^Z)), 

hence the corresponding generating function 

(36) G(z,u) = e u{e '- 1] . 
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The generating functions solve the corresponding enumeration problems. By an 

1, the total number vj n of partitions of a set of 



expansion of ( 36 ) taken at u 
cardinality n satisfies the "Dobihski relation" 

(37) w n = n\[z n ]e e '- 1 = 



_1 E 



px 

IT' 



a quantity known in combinatorics as a Bell number [25] . The number of partitions 
of n elements into k classes is the Stirling number of the second kind, nowadays 
usually denoted by {^}, whose value is [231 1511, |6"T] 



(38) 



= n\\z n u k ]e u ^-^ = ^[z"](e 2 - \) k = 



k\ 



1 

fc! ^ 

3=0 



(-l) k - j j n , 



as can be immediately verified from a series expansion of (37). 

Back to the normal ordering problem, each connected component of a diagram, 
i.e., each chain of XD-gates, has one input and one output, so that the balance of a 
single component is of the form XD, the balance of k components being accordingly 
X k D k . We then have: 

Proposition 5. The normal ordering associated with e zt ^ XD ^ involves the Stirling 
partition numbers: 



(39) 



„«(e a -l) 



E 



(u k ^X k D k 

n 
k 



E 

L k 



x k D k 



fe>0 



l) k X k D k . 



Note 8. Yet another PDE. Proceeding as before, we obtain from (391, that the PDE 

dF _ dF 
dt dx 

with initial condition F(x, 0) = f(x) possesses the solution 

F(t,x) = ^(e* - l) k X kL ^f(x) = fMe* - 1) + x) = fixe'), 



whose a posteriori verification is immediate. 



Note 9. History. The origin of the Stirlng partition numbers {^} and their cognates, the 
Sirling cycle numbers [ n ] , lies in eighteenth century calculus. The classical way of defining 
them is as the coefficients expressing, in the polynomial algebra C[x], the change of basis 
between the canonical basis x n and the factorial basis x" = x(x — 1) • • • (a; — n + 1) or its 
trivial variant x n — x(x + 1) ■ • ■ (x + n — 1). Indeed, one has [611 pp. 248-249]: 



(40) 



E 



E 



(In a different circle of ideas, Stirling numbers are known to probabilists as a way of 
relating factorial moments and standard power moments |31l p. 47].) Upon examining 
the effect on coefficients of power series, it is then seen that Stirling coefficients serve to 
express the connection between powers (xd x ) n and standard derivatives, via x k d x , which 
is exactly what we derived by elementary combinatorics in (|39[). An equivalent operator 
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9*3 gm e 9 a it 



FIGURE 8. A sample diagram associated with (XD + g(X + D)). 

formulation appears as an exercisep^in Comtet's book |25l Ex. 2, p. 220]. (Some of these 
properties were already familiar to Scherk in 1823; cf Appendix [A] p. 72 



In the context of quantum physics, the combinatorial connections between the normal 
ordering of (a^a) n and Stirling numbers have been recognized early: see, for instance, the 
papers of Wilcox [1331 p. 978], for an algebraic perspective, and especially Katriel [77] . for 
the combinatorial connection. (See also references therein to earlier works by Schwinger 
and others.) This thread has given rise to a large body of subsequent literature. In 
particular, Bender, Brody, and Meister explicitly discuss Bell numbers in the context of 
diagrams in |5J, although their approach differs substantially from ours, as they focus on 
interpretations of the schema e^^e^ x ' . The relations between set partitions, diagrams 
and Stirling numbers are used as a lead example by Baez and Dolan [3: to illustrate the 
process of "decategorifying" (!) the creation-annihilation theory. We examine below, in 
Subsection |4.2[ further interesting extensions of the combinatorial approach that are due 
to Blasiak et al ■ 

Note 10. Normal forms relative to (XD + g(X + D)). We briefly discuss this case, which 
is treated by Wilcox |133| and Louisell [89] by means of operator calculus, as it illustrates 
the versatility of the combinatorial method. The gates are now of the three types X, D 
and XD: 

A A 



XD 



gD : 



The connected components are similar to those relative to (XD), with, in addition, the 
possibility that a line graph can be "capped" with a D, or "cupped" with an A", or both 
capped and cupped; they are consequently of four possible types. Hence, one can write 
directly the specification of graphs as 

9 = Set ((uv + ug + vg) Set>i(£) + g 2 Set> 2 (£)) , 

where u and v mark inputs and outputs, respectively. The corresponding multivariate 
EGF is then 



(This case prefigures the planted-tree construction of Subsection 6.3 p. 39 



14 This interesting exercise also contains the reduction of (X a+1 D) n and its connection with 
{(1 — at) l / a — 1} [with the minor typo of a missing exponent of n], which we shall encounter later 
in Sectionjfij see also Riordan |109l §6.6] for related operational calculus derivations. 
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4.2. The product form (X 2 D 2 ). This subsection and the next one serve to revisit 
the combinatorial works of Blasiak, Horzela, Penson, Solomon, and coauthors [TOl 
IT51 IT5I ITTJl |9"9"] . See also Schork's synthetic study |114j , which furthermore includes 
some g-analogues. To avoid cumbersome notations, we start with a discussion of 
the operator X 2 D 2 . What is at stake is understanding the structure of reductions 
such as 

f (X 2 D 2 ) = X 2 D 2 

(41) \ (X 2 D 2 ) 2 = 2X 2 D 2 + AX 3 D 3 + X 4 D 2 

[ (X 2 D 2 ) 3 = AX 2 D 2 + 32X 3 D 3 + 38X i D i + 12X 5 D 5 + X 6 D 6 , 

where the sums of the coefficients form a sequence ra7 2,2 , which starts as 

(42) (m 2 n ' 2 ) = 1, 7, 87, 1657, 43833, 1515903, 65766991, 3473600465, .... 

These numbers appear as Sequence OEIS A020556 in Sloane's Online Encyclopedia 
of Integer Sequences |116j , henceforth abbreviated as " OEIS" . The triangle of coef- 



ficients in the expansion of powers of (X 2 D 2 ) as in (41 1 is Sequence OEIS A078739. 

By the combinatorial isomorphism, we now have one kind of gate, namely, X 2 !? 2 ; 
that is, a gate has two input edges and two output edges. Such a gate thus picks up 
two, one or none of the previously existing outgoing edges "prolonging" the hooked 
edge and generating respectively none, one or two new links; the local balance of 
the number of links (edges) being clearly null. In other words, we can view a gate 
as simply "propagating" links. It is then of advantage to align edges vertically 
and represent an X 2 D 2 -g&te by a horizontal vector: the orientation conventionally 
serves to distinguish the first input from the second input; these inputs can be 
conveniently tagged by a © and a sign, respectively, as in the following diagram: 



(43) 





When such gates are stacked on top of one another, they then give rise to piles of 
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Q 



Set(Z') 



FIGURE 10. A graphical rendering of the isomorphism expressed 
by (451: on the left appear "relaxed" scaffoldings; on the right the 



product form that isolates unused inputs and a proper scaffolding. 



vectors that can be viewed as (weird!) "scaffoldings"; see Fig ure [9} (This structure 
is loosely evocative of Viennot's theory of "heaps of pieces" [129 .) 

Bilabelled structures. Let S n ,k be the collection of all diagrams comprised of n 
gates of type X 2 D 2 that have k inputs and let {^} 2 = S n> k be the corresponding 
cardinality. We consider the class Sk — U n S n ,k, which it is now our goal to 
enumerate. This can be achieved by considering bilabelled objects, that is, objects 
carrying two kinds of labels. In the case at hand, we consider the extended class Sk, 
where inputs bear independent labels, l',...,k', which are indicated by a prime 
("primed"), in order to distinguish them from the usual gate/vertex labels. The 
labelled product extends trivially — distribute independently both types of labels. 
It translates straightforwardly as a product of biexponential generating functions, 
of the form 



(44) 



n /-v 

V f — — 



where the powers of z and £ record, respectively, the number of standard and primed 
labels. 

Let S n k be the class of diagrams of size n with k labelled inputs. In terms of 
cardinalities, we have S n< k = k\S nt k- Indeed, in a standard, singly labelled diagram 
of <Sn,fc, all inputs are distinguishable, say, according to the time they are first used, 
and by considering the first input, tagged "®", before the second one, tagged "0" 
(see Figure [9]); there are thus exactly k\ ways to superimpose an input labelling on 
a diagram. We then claim the identity 



(45) 



Q = Set(Z')* U^ fe 



Here, Z' represents an atom that carries a primed label, but no standard ("un- 
primed") label; the class Q is a relaxed version of [J k Sk in which extra (primed- 
labelled) inputs are allowed; the labelled product '★' is, in accordance with our 
previous discussion, taken to distribute both kinds of labels. 
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In order to justify (45), observe that the left hand side describes all possible 



sequences of choices of pairs of distinct prime-tagged elements amongst a set of 
cardinality k, with the possibility of some of the elements to be unused; the factor 
Set(Z') in the right hand side simply gathers all these unused inputs, with the 



sum there corresponding to legal (bilabelled) diagrams; see Figure 10 



We next effect the translation into bivariate generating functions, here taken to 



be of the form (44 1. Thus, z records gates and £ records inputs. (Note the subtle 
difference between (, as a carrier of primed labels, and u, as a plain unlabelled 
marker in earlier developments relative to (XD).) By the definition of Q, we have 

00 fk 7 n 

n,k=0 



Equation ( 45 ) then translates as the biexponential generating function identity 



^k z n 

eC E k[s ^M^ = 

n,k=0 



v /-k -n 

n,k=0 



which solves to give explicitly (with |^} 2 = S n .k) 



(46) 



E 

n,k=0 



2.2 



5> ■(*-•)]■ W 

ra,fc=0 



Collecting the coefficient of z n and expanding the result as a binomial convolution 
leads to the following statement [!§] . 

Proposition 6. The normal ordering associated to e zlyX D satisfies 



(47) 



where the generalized Stirling numbers {^} 2 2 are 



(48) 



" ' k .)\ 



'2,2 - i=2 

This calculation also implies a Dobihski-type formula for the generalized Bell num- 



bers of (|42j) 
(49) 



"E 

l>2 



[^-1)] 

a 



an identity given in |19j which is further generalized in Proposition [7] below. The 
formulae (48) and (49) are seen neatly to extend their standard counterparts^ 



in (38) and (37). Finally, expanding by the binomial theorem the quantities 



^ A similar, simplified, treatment can also be inflicted to the usual Stirling numbers. Let W 
be the bilabelled class of all finite words over the labelled alphabet {1', 2', . . . }. The biexponential 
generating function is W(z, Q = , k n (( k /k\)(z n /n!) = exp(£-e z ). With U representing a stock 
of "unused" letters, we have the identity W = U * 1Z, where TZ is the class of "gapless" words, in 
the sense that their letters form an initial segment of {1', 2', . . .}. (The class 1Z is isomorphic to the 
class of "surjections" |51l §11.3], also known as ordered set partitions or preferential arrangements.) 
We then have W(z, Q = e^R(z, f), which implies R = e~ c -W giving back R(z, Q = cxp(£(e z — 1)), 
which is both a biexponential generating function of surjections and the usual bivariate exponential 
generating function of set partitions; cf Equation (|36|, with the substitution £ t— > u. 
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1)]™ in (48) or [£(£— 1)]™ in (49) serves to relate the generalized and basic families 



of numbers: 



r=0 \ / v J r=0 



J n-\-r • 



Note 11. Matrix enumeration and diagrams. Let k be the collection of matrices 
with n rows and k columns, having all their entries in {0, ±1}, such that the following 
conditions are met: (i) there is exactly one +l-entry and one — 1-entry in each row; (ii) no 
column consists solely of 0s. The corresponding cardinality satisfies 

\ i 2,2 

as the set of matrices is clearly isomorphic to diagrams relative to X 2 D 2 with labelled 
inputs (the column numbers) — these constitute the class Sk above. 

Similarly, let Mt k be the collection of matrices with n rows and k columns, having all 
their entries in {0, 1}, such that the following conditions are met: (i) there are exactly two 
1-entries in each row; (ii) no column consists solely of 0s. The corresponding cardinality 
satisfies 



2~ n kU 

2,2 

since this is equivalent to killing the orientation of gates. 

The counts of matrices without restrictions on the number of rows are then, respectively, 



2n f "I 2n ( 

k=2 { ) 2,2 fe=2 I. 



The sequence of values (M„) n >j starts as 

f , f 3, 409, 239f 7, 2244361, 308682013, 58514835289. 

It has number OEIS A055203, with a description, "The number of different relations 
between n intervals on a line", which is easily related to our previous discussion. Also, 
perhaps more picturesquely: "Imagine you have n events of non-zero duration, in how 
many different ways could those events overlap in time?" . (The OEIS refers to some 
unpublished work of S. Schwer relative to temporal logics and formal linguistics.) . . . . ■ 



Note 12. Coupon collection with group drawings. In probability theory, the coupon 
collector problem asks for the distribution of the number of samplings with replacement 
from a finite set £, which are needed till a complete collection of the elements of £ is 
obtained. If T is this random time and m is the cardinality of £ , one has 

<■» '-p-*«i-£{:}-£(:)<- i >'(s)"- 

since the event {T < n} corresponds to a sequence of n choices (each with m possibilities), 
such that each of the m possibilities is chosen at least once. (The m! factor corresponds 
to the fact that ordered set partitions are to be considered, since the m elements of £ 
are distinguishable.) See Feller's book [42] for a classical probabilistic derivation and [511 
pp. 116-117] for a treatment cast within the framework of analytic combinatorics. It is 
also well known that the expected time for a complete collection satisfies 

E m [T] = 1 + i + ■•■ + — =H m = logm + 7 + o(l). 
2 m 
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The coupon collector problem with group drawings is the variant problem, where one 
now draws from £ in groups of r distinct elements. Here r = 2. This problem is, once 
more, a variant of those previously considered, and one finds 

n I ( m\ k ( k(k — 1 



P ^^^= (mf m -^n \J =E , (-D 



(m(m — l)) n 1 m [ ^— ' \ fc y \m(m — 1) 



which exhibits a pleasant similarity to the basic case (501. This formula was obtained by 
Stadje [118) by means of a subtle use of the inclusion-exclusion principle combined with 
suitable combinatorial identities — the derivation above seems to us much more transpar- 
ent. Analytic methods then make it possible to derive the expected value [3S] 

»- M m(m-l) („ 1 \ 1, 7 

E m [T = — ^ '- H m H — ^ — tv = - logm + ± + (i). 

1 J 2m- 1 \ 2m -1 (m + 1) f™^ 1 ) ^ 2 & 2 y ' 

This analysis is itself intimately related to that of the number of isolated vertices under 
the Erdos-Renyi random graph model [201 §7.1] ■ 

Note 13. Set partitions and contiguities. Here is yet another interpretation of generalized 
Stirling numbers. Given a partition n of {1, . . . , n}, a pair + 1) is called a contiguity 
of 7r if j and j + 1 belong to the same block. By extension, we also say that the number j 
is a contiguity. 

In a diagram, we can view the gate bearing label j as being associated to two inputs 
labelled 2j — 1 and 2j. In this way, the set of labels becomes {l,...,2n}. We now 
consider partitions of {1, ... , 2n} and, given a gate, group in a single block all inputs that 
correspond to the same thread (i.e., are "vertically aligned"). The rules corresponding to 
the formation of diagrams then imply the following: diagrams formed from X 2 D 2 gates 
that are of size n are in bijective correspondence with set partitions of size 2n, where 
contiguities of odd values 1,3,5,. .. , are forbidden. Thus the total number of such set 



partitions is the generalized Bell number tuj;' 2 of (491. 

This raises the question of enumerating set partitions according to the number of 
contiguities. The number B n of those without any contiguity turns out to be the shifted 



(standard) Bell number crj n -i, where w„ is defined in (37l. This can be shown from the 
relation 

B a *U = B, W = Set(£), 

which expresses the fact that an arbitrary partition (the class B of all set partitions) can be 
obtained from a contiguity-free partition (the class B) by gluing extra atoms (from U) after 
their immediate predecessors (hence the appearance of the box operator in B a ). From this 
specification and the general translation rules seen earlier, a simple computation provides 
the EGF: by (251, we have (with here B n = zu n , so that B(z) = e e _1 ) 



J B'(z)e z = B(z), implying B(z) = j B(z) = ]T Bn-i 

n>l 

This solves the enumeration problem, as B n = B n -\. It is seen from here that 

e z -\-uz — l 



B'(z)e uz = J e 

enumerates set partitions according the number of contiguities, marked by u. (An analo- 
gous process serves to relate derangements to the class of all permutations.) ■ 

In summary, the orbit of equivalences mentioned above (matrices, coupon col- 
lector, set partitions) justifies considering the generalized Stirling numbers {^} 22 
as basic quantities of combinatorial analysis. 
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4.3. Higher order forms (X r D r ). The discussion relative to X 2 D 2 extends to 
X r D r and more generally to balanced polynomials, which are of the form 

r 

(51) S^mX'D 1 , 

1=1 

where r\ = (rjj) is a sequence of arbitrary coefficients. We state: 



Proposition 7 ( [TU1 [T5] [T3] ) . Let f) be a balanced polynomial in the sense of (51 1. 
The I) -Stirling numbers defined as the coefficients in the expansion 



x k D k 



admit the explicit form 

( 52 ) {fc} h = ^ p^- 1 ^ 3 Q) /i(j)n ' h{x) := ^ m ' x{x i) ■ ■ ■ {x i+ iy 

Proof. A combinatorial proof based on gates and graphs proceeds along the same 



line as what has been done for X D . See the diagram of (43 1 and Figure 10 What 



happens is that each rung of a scaffolding is now comprised of an ^-arrangement 
(for some I satisfying 1 < i < r) of labelled inputs: the number of possibilities 
for each rung to be connected to j inputs is then exactly h(j). Then, with Sk the 
collection of scaffoldings with k labelled inputs and Q the relaxed scaffoldings that 
may have unused labelled inputs, the relation (45) still holds (see also Figure 10). 
We then have the equality of double exponential generating functions 

n.k K J n,k J 

This last relation is equivalent to the statement upon extracting the coefficient 
of x k z n . □ 

The note below provides a typical alternative derivation taken from [TUJ [T51 [TU] 
and based on simple algebra; see also |114l §3]. 

Note 14. Algebraic reduction of (X r D T ) n . The idea is to apply (X r D r )" to the ex- 
ponential function e x . We write {^} rr := f° r b = X r D r . On the one hand, we 
have 



k x 

x e , 



(54) {X r D r ) n e x = Y.\ k \ ^e^Eu 

k I J r ,r k I , 

by the definition of the {^} r r numbers and the fact that D k e x = e x . On the other hand, 

(55) (X r DT e x = Y. b'C? - 1) • • • (J ~ r + 1)]" 



by the Taylor expansion of e x . The comparison of ( 54 1 and ( 55 1 yields 

Ejfcl **=e-Eb-o--i)-"(;-r+i)r J, 

k \ J t j- j 

which is equivalent to the statement of Proposition [7] in the case f) = X r D r 
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Schork reviews the numbers of Note 14 in |114j and discusses as well some q- 



analogues (see also the early work of Katriel and Kibler [52] , the papers by Mendez 



and Rodriguez [57] HE], and Section 8.2 below). The normal form problem for 



(X r D s ) n unites the case of (X r D r ) n discussed above and the case of (X r D) n , 
which is closely related to trees: we shall accordingly discuss it later, in Note [18] of 



Subsection 6.2 p. 38 Relative to these and other cases, we note that Mikhailov [100] 
obtained the normal ordcrings of 

(X + D) n , (D r +X) n , (D + N) n , (D 2 +N) n , 

where N — XD; his end-formulae are "combinatorial", but his derivations are 
essentially algebraic. See also Katriel [75] for related material. 

5. Quadratic forms (X 2 + XD + D 2 ), zigzags, and permutations 

The normal forms associated with (X 2 + D 2 ) are related to evolution equations 
of the form 

d d 2 

0^{x, t) = g^2^(x, t) + x 2 ip(x, t) 

(in the case of the Schrodinger equation, this corresponds to a quantum harmonic 
oscillator, i.e., a quadratic potential) and are of great importance in quantum op- 
tics, where "squeezing" is introduced by means of quadratic forms in annihilation 
and creation operators. It is then not a surprise that the normal form problem for 
(powers of) quadratic forms should have been studied early, and, for instance, a 
paper by Mehta [5S] published in 1977 contains results equivalent to our Proposi- 
tions [8] and [9] below; see also Wilcox [133| §10] (published in 1967). The derivations 
in the cited paper are entirely operator-algebraic. As we show here, such normal 
forms follow routinely from the representation by diagrams in conjunction with 
standard methods of combinatorial analysis. This approach, which, to the best of 
our knowledge is new, furthermore reveals connections with alternating permuta- 
tions (Subsection |5.1[) as well as with general permutations classified according to 



local order patterns (Subsection 5.2 1 



5.1. The circle form (X 2 + D 2 ) and zigzags. For the operator (X 2 + D 2 ), the 
associated gates have either two outgoing (X 2 ) or two ingoing lines (D 2 ); these are 
then shaped like a "cup" or a "cap" : 




A general graph relative to (X 2 + D 2 ) consists of (weakly) connected components, 
and each connected component involves caps and cups in alternation: we shall refer 
to them as "zigzags" ; see Figure [TT] Note that since the inputs and outputs of gates 
are distinguishable, edges may cross. It is then easily realized that zigzags can be 
of one of four types, depending on the excess of the number of free outputs over the 
number of free inputs — values for this excess can be —2,0, +2 — with, in addition, 
for excess 0, the fact that diagrams may be either "closed" or "open" . This gives 
rise to four different types of connected components, which we denote by A,B,C, T>. 
In summary, with Q being the class of all graphs relative to [X 2 + D 2 ), u marking 
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(B) 



(C) 




FIGURE 11. The four classes of connected components (zigzag's) 
A,B,C,T> relative to (X 2 + D 2 ). 



the number of outputs, and v marking the number of inputs, the decomposition 
into connected components is expressed by 

(56) Q = Set (u 2 A + v 2 B + uvC + V) , 

Removal of the vertex with the largest label (in the case of A,C, V) or with the 
smallest one (in the case of B) shows that a zigzag can be decomposed into one or 
two smaller zigzags. This is expressed by the formal specification 

A = Z + A(A*Z a *A) 

B = Z + 4(B*Z a *B) 
C = 4 (A*Z a *(C + £)) 
V = 2(Z a *A). 

The multiplicities (2 and 4, as the case may be) reflect the various possibilities 
of attachments: for instance, in the case of type T>, there are clearly two ways of 
attaching the largest label to a previously existing zigzag of type A. (The class £ 
is comprised of a unique "neutral" element of size 0; the class Z consists of a single 
atom of size 1.) 



(57) 



(58) 



The decomposition into connected components ( 56 1 gives 

2 



G(z; u, v) = exp (u 2 A(z) + v 2 B{z) + uv C(z) + D{z)) 



while the combinatorial specification (57) give rise to integral equations: 

A(t) 2 dt, 



(59) 



A{z) =z + A 



C(z) 



A(t)-(C(t) + l)dt, 



B(z) 
D(z) 



z + A 



B{tydt, 



A(t) dt . 



d z B{z) 



1 



AB(z) 2 , 
d z D(z) = 2A(z) , 

C(0) = -D(O) = 0) admit separation of 



The equivalent differential equations, 

' d z A(z) = l + 4A(z) 2 , 
d z C(z)=AA(z)-{C{z) + l) 

(with the initial conditions ^4(0) = B(Q) = 
variables, hence they have closed-form solutions: 

A{z) = § tan(2z), B(z) = \ tan(2z), 

c\z) = cos (2Z)- 1 - 1, D{z) = -\ In (cos (2z)) . 

(Note that equality of generating functions A{z) = B{z) mirrors the isomorphism 
between classes A and B, which can be seen explicitly by a horizontal reflection of 
zigzags and a relabelling of the vertices according to 1, . . . , m i— > m,...,l.) 



(60) 
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Collecting the results of (58 1 and (60), we obtain 



Proposition 8. The generating function of all graphs associated with (X 2 + D 2 ) 
admits the explicit form 

(61) G(z;u,v) = 1 • exp(i(u 2 + v 2 )tan(2z) + uv(sec(2z) - 1)) . 
\/cos(2z) 

Note 15. Zigzags and alternating permutations. The enumeration of zigzags is closely 
related to one of the most classical problems of combinatorial analysis; namely, the enumer- 
ation of alternating permutations; see [251 pp. 258-259] and [1221 pp. 73-75]. (We follow 
the presentation in [511 §11.6.3].) Let a permutation a = o~\ • • • <j n be written as a word (so 
that <Ti represents o~(i)). The permutation a is said to be alternating if o\ < o%, 02 > 0-3, 
and so on, with an alternating pattern of rises (o"2j-i < o~2j) and falls (pn < <72j+i). 
Let S and T be, respectively, the EGF of even-sized and odd-sized permutations. By a 
maximum based decomposition analogous to what we have seen before, the specifications 
are 

T=(T*£ B *T); S = {e} + (T*£"*S). 

Hence, via integral and differential relations, the well-known solution^] 

z 2 z A z 6 

S(z) = sec(z) = 1 + 1t„-+5 — +61 — + ■■■ ( OEIS A000364) 





z 2 


z 4 z 6 


1 + 1 


2! 


K 




z :i 




z + 2 


3! 


+ 16 | + 27 2 if 



T(z) = tan(z) = z + 2 — + 16-^-+ 272 — H (OEIS A000182). 

Also of interest are alternating cycles defined to be directed cycles whose edges have 
alternatively increasing and decreasing end points. These are necessarily of even size with 
specification U = (Z m *T), so that U(z) = f T(z) and U(z) = logsec(z) whose coefficients 
are a shifted version of the tangent numbers n! [z n ] tan(«). The undirected cycle version 
has EGF V(z) — | logsec(z) + \z 2 , so that (undirected) graphs whose components are 
(undirected) alternating cycles have EGF 

p z 2 /4 r 2 4 6 8 

^^^R^ 1 + 1 2! +4 4T + 38 6! +71 °8! + ---' 

which is not found in the OEIS. For reasons discussed below, in Section [7] several of these 
numbers have OGFs that admit explicit continued fraction representations ■ 

5.2. The general quadratic form (a D 2 +(3 X 2 +j XD). We treat here diagrams 
corresponding to the general quadratic form 

aD 2 + I3X 2 +7ID, 

which introduce extra gates having one ingoing and one outgoing line (compare 



with Subsection 5.1) 



A 

i D z : am , fiX 2 : ¥ , -(XD : 7II 




Hence, the graphs again have the structure of open and closed zig-zags, but now 
with ascents and descents of arbitrary length, see Fig. [12] Additionally, parameters 
a, f3 and 7 keep track, respectively, of the statistics of peaks, valleys and rises/falls 



16 We note, following the referee's remark, that the link between calculation of vacuum expec- 
tation expectation values of powers of the quadratic form X 2 + D 2 with alternating permutations 
and secant numbers has been also observed by C.V. Sukumar and A. Hodges in |66l I123 |. 
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(V) 



FIGURE 12. The four classes of connected components A, B,C, T> asso- 
ciated with (aD 2 + /3X 2 +jXD). 



in the construction. The analysis of this case closely follows the scheme given in 
Subsection 15. II We state: 



Proposition 9. The generating function of all graphs associated with (aD 2 +/3X 2 - 
^XD) admits the explicit form 



G{z; u, v) 



" X|> 1 1 l~a~ + If}) ^ tan ( Sz + e )-"f) 



(62) 

where 
(63) 



exp uv 



cos (0) 



cos (5 z + 8) 



e2' 



cos (6) 



cos (8z + 9)' 



5 = \J 4a j3 — 7 2 , 6 = arctan(7/<5). 



Proof. The decomposition into connected components gives, for the class of all 
diagrams: 



(64) 



G = Set (u 2 A + v 2 B + uv C + V) . 



The specification of the four types of components paralells that of ( 57 ) : 



p Z + 27 Z m *A + 4a A*Z u -kA, 

a Z + 2j Z a *B + 4f3 B*Z a *B, 



A 
B 

C = 7 Z + 7 Z u *C + 4a A*Z U *(C +£) , 
V = 2aZ u *A. 
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The translation into equations binding EGFs is now automatic. First, we have 
G(z; u, v) = exp (u 2 A(z) + v 2 B(z) + uv C(z) + D(z)) . 



Next, 



/3z + 2~/ / A(t)dt + Aa / A(t) 2 dt, 
Jo z Jo z 

az + 2j / A(t)dt + 4I3 / B(t) 2 dt, 

72 + 7/" C(t)dt + 4a [ A(t)-(C(t) 
z Jo Jo 

2 a A(t)dt. 



l)dt, 



A(z) 
B(z) 
C(z) 
D(z) 

Jo 

The corresponding differential equations are 

d z A(z) = f3 + 2 1 A{z)+AaA{z) 2 , 

d z B(z) = a + 2jB(z) + 4/3B(z) 2 , 

d z C(z) = ( 1 + 4aA(z))-(C(z) + l) , 

d z D(z) = 2aA(z), 

with initial conditions A(0) = B(0) = C(0) = D(Q) = 0. There is again separation 
of variables. However, as exemplified by the case of A, one now needs to solve (with 
y = A(z)) 

d JL = d7 

/3 + 277/ + Aay 2 

which neccessitates factoring the quadratic form in the denominator, hence solving 



a quadratic equation: this introduces the auxiliary quantities (63). The solutions 
found are 

S 7 + S tan (5 z) 7 
4a 



A(z) 
B{z) 
C(z) 



4a 5 — 7 tan (5 z) 
cos (6 z) — ^ sin (6 z) 



Ta^ {5z + 6) -l> 



1 



D(z) = — I In (cos (6 z) — J sin (S z)) — J z = 



cos (6) 
cos (5 z + 0) 

z \cos(dz + 



- 2 z 

2 / - 



from which the statement results. 



□ 



Note 16. Generalized Eulerian numbers. In the same way that zigzags are closely related 
to alternating permutations, the diagrams considered here are closely related to a quadri- 
variate statistics on permutations; namely that of the number of peaks, valleys, double 
rises, and double falls 59, Ex. 3.3.46, p. 195]. For instance, Carlitz and Scoville [23] found 
the corresponding multivariate EGF, 

p a 2 2 _ p a t z 

CX1CX2 = M1U2, Cti + Ct2 = U-j + U4 



a 2 e aiz — ot\e a2Z 



(ui, 112, U3, M4 mark, respectively, the four types of elements listed above), which suitably 
generalizes the bivariate EGF of Eulerian numbers {OEIS A008292 and [25] p. 244]), 

1 - u 



(65) 



A(z, u) = 



1 - ite^ 1 -") ' 

which enumerates permutations according to the number of rises (do ui = U3U, 112 = 
ii4 = 1). See |51l p. 202] for a derivation along the lines above. The corresponding 
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A A 



FIGURE 13. The basic gate Y associated with X 2 D (left) and a corre- 
sponding diagram (right box). 

OGFs also have explicit continued fraction expansions [45], which are closely related to 
the combinatorial approach of the present study — see Subsection |7.1| ■ 

6. Semilinear forms (<fi(X)D) and increasing trees 

We consider here first-order differential operators of the general form 4>(X)D + 
p(X), where (/> is a (nonlinear) polynomial of arbitrary degree. The general combi- 
natorial model is that of increasing trees, which were discussed early by Leroux and 
Viennot [SHJ [ST] and further developed by Bergeron, Labelle, and Leroux in their 
reference text [3 Ch. 5]. (The subject is treated under the combinatorial-analytic 
angle by Bergeron et al. 6 .) In the physics literature, Lang [84] seems to have been 
among the first to discuss the normal ordering associated with X r D; see also his 
later paper [55] . Then Blasiak et al. [H] [TB] discovered the relationship between 
general abstract varieties of increasing trees, as in [B], and the forms (<p(X)D), 
which we propose to examine now. It is pleasant to note that the developments 
of this section also serve to answer several algebraic-combinatorial questions first 
raised by Scherk in 1823; see the discussion relative to Figure [27j p. [77J of the 
Appendix. 

6.1. The form (X 2 D) and increasing trees. In the case of the operator X 2 D, 
the gates have one input and two outputs. They are thus shaped as a letter "Y"; 
see Figure [13} By successively grafting outputs of such Y's to inputs of other Y's, 
we obtain graphs that are necessarily acyclic, so that each connected component 
is a tree. These trees are plane trees (there is a distinction between left and right 
outgoing edges); they are binary, by design; and, finally, they are increasing in the 
sense that labels increase along any branch stemming from the root. Such trees are 
quite well known in combinatorial theory under the name of binary increasing trees, 
their importance being due to the fact that they are in bijective correspondence 
with permutations. See for instance [5T] §11.6.3] or |119) . 

We commence with the class T of binary increasing trees. A tree in T is obtained 
by starting with a root that bears the smallest label; then, each of the two output 
edges is either left untouched or it is attached recursively to a similar tree. Within 
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the language of specifications summarized in Section [2j this is expressed as follows 
("£" represents a neutral structure of size 0): 

r = Z D * (£ + T) * (£ + T) . 

The translation to EGFs is then 

T(z) = f (1 + TO)) 2 dw, 



Jo 

which, by differentiation, leads to the equation 

T'{z) = (f + T(z)) 2 , T(0)=0. 

Since this last differential equation admits separation of variables, we find T"/(l + 
T) 2 = f , hence -1/(1 + T) = z - 1; that is, 

n=l 

This verifies that T„ = n!: binary increasing trees are equinumerous with per- 
mutations. (Combinatorially, here are two easy combinatorial arguments: (i) the 
number of choices for successively adding a new gate is 1, 2, 3, 4, 5, . . .; (ii) a per- 
mutation is obtained when node labels are read in infix, i.e., left-to- right, order. 
The formal relation with binary trees, is well explained combinatorially by Lcroux- 
Viennot [86l [87]: see also Ex. 12, p. 383].) 

Now, a connected diagram has one input (the link into the root) and (n + 1) free 
outputs, if n is the number of nodes (Y-gates associated with (X 2 D)) of the tree. 
Thus, the trivariate EGF, where u marks outputs and v marks inputs is 



vuT{uz) = ■ 
The EGF of all diagrams is accordingly 

G(z; u, v) — exp 



vu 2 z 



1 — uz / 

In particular, the total number of diagrams of size n > 1 is 



n!TO,;l,l ) = n![,«]e^) = ^r ^ 

k=l V / ' 

The sequence starts as 1, 1, 3, 13, 73, 501, 4051, 37633, and is OEIS A000262 ("sets 
of lists"); see also [SH p. 125] ("fragmented permutations"). We are indeed enumer- 
ating unordered forests of increasing binary trees, equivalently, sets of nonempty 
permutations, i.e., structures of type Set(Seq> 1 (^)). 

This approach applies almost verbatim to the operator X r D, with r > 2. In this 
case, we are dealing with r-ary trees, where each node has r outgoing edges: 




X r D 
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The differential equation becomes T' = (1 + T) r , with solution 



(66) T = [1 - (r - 1); 



1, and n![z"]T(z) = n! 



n + l/(r - 1) - 1 



The balance between the number of outputs and size is dealt with as before (with 
an r-ary tree of n internal nodes having n(r — 1) + 1 outgoing edges). SoQ 

Proposition 10. The EGF of diagrams associated with (X r D) is 

I in) \ 

G(z; u, v) 



exp 



.r-izlVCf-i) 



Jl - (r - l)u r 

for r — 1, the normal forms associated with \) — (X 2 D) simplify tq \ 



/ // — \ \ hi 



Jt-lJ kV 

For r>3, one has 01(f)") = ^ ln ^ k X k+(r ~ 1)n D k , with 

k 



^Jn^k 



' 1=0 



n + £/{r- 1) - 1 
k 



(r-iy 



Combinatorially, the number of possibilities for successively adding a new gate is 



now 1, r, (2r — 1), . . ., which is consistent with (66 1. The derivation given above, via 



EGFs, has however been adopted since it is the only one applicable to the general 
case ((f>(X)D), as we see next. The total number of graphs for r = 3,4 starts as 

r = 3: 1,4,25,211,2236,28471,422899,7173580,... OEIS A049118 
r = 4: 1,5,41,465,6721,117941,2433145,57673281, ... OEIS A049119, 

and these coefficients are expressible as binomial sums; see Lang's study [84] for 
an early discussion of these numbers revisited in |85j and the papers [121 116] by 
Blasiak et al. for relations with the normal ordering problem. 

6.2. The general case (cj>(X)D). In this case, we must consider a polynomial 

R 

with gates of the form X^ D, any such gate being assigned weight <f> r . The corre- 
sponding increasing trees can thus have various types of internal nodes, as dictated 
by 4>{y). Graphically: 

A 




Y + V 



XD 



A 

X 2 D 



A 

X 3 D 



Interestingly enough, Comtet has a form equivalent to the one we give, as an exercise in his 
book |25l Ex. 2, p. 220] relative to "Lie derivatives and operational calculus". 

The coefficients, up to sign, are sometimes known as "Lah numbers" (OEIS A008297 
and |1I1I p. 44]), the Lah polynomials being essentially Laguere polynomials. 
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FIGURE 14. A diagram with some (p(X)D that is comprised of two 
rooted trees. 



In combinatorial terms, this defines a variety of increasing trees: the basic and 
asymptotic theory of these being the subject of the paper [5J. 

The graphs associated with tfi(X)D are still (unordered) forests of increasing 
trees, themselves denoted by T~, or T^, whenever the dependency on cf> needs to be 
made explicit. The basic equation for the EGF T = T(z,u), with u a marker for 
"unsaturated" outgoing edges is then (Figure 14) 

r times 



(67) T = (f>r ( 2° -k (u + T) -k ■ ■ ■ -k (u + T) 

3=0 ^ 

Thus, we have 

T(z,u) = / (j){u + T(w,u)) dw, 
Jo 

implying 

^-T{z,u) = 4>{u + T(z,u)), T(0,u) = 
and separation of variables yields, with u a parameter: 



(u + t) 



T(z,u) 

We can now conclude as follows. 
Proposition 11. Define 

(68) m ..= f V ^L 

Jyo <M T ) 

where yo is an arbitrary nonnegative constant chosen so that 4>(ya) ^ 0. Let T = 
T^(z,u) be defined as the solution of 

(69) T : $(T + u) - = z, with T(0, u) = 0. 
Then the multivariate EGF of all diagrams relative to (4>(X)D) is 

G(z; u, v) — exp (vT^(z, u)) . 



38 



P. BLASIAK AND P. FLAJOLET 



Proposition [TT] provides a solution to the general problem posed by Scherk in 
his doctoral dissertation [1131 §8] of 1823: see our Appendix |Aj p. 77 The re- 
marks relative to Scherk's thesis, p. [76| below, provide a concrete illustration of the 
combinatorics of increasing trees in relation to the normal form of {<p{X)D) n . 

On another register, it is shown in [6] that the singular structure of T(z, 1) can be 
systematically determined, from which there follow many asymptotic distributional 
analyses of parameters, including path length, node-degree profile, root degree, and 
so on. However, as regards explicit forms that are of concern for us here, only a 
few functions <f)(y) are susceptible to exact expressions; for instance, 

(70) y r , (l + y) r , (ay 2 + by + c), y(y + 1) • • • (y + r - 1). 

Note 17. Some solvable varieties [HI §2] . First, (X 2 D+D), corresponding to 4>(y) = y 2 + l, 
leads to a solvable case: 

f v{l + u 2 )t&nz\ . ( 2tan(2) 

G(z; u, v) = exp — , G(z; 1, 1) = exp ' 



1 — u tan z J \ 1 — tan z 

The coefficients n\[z n ]G(z; 1, 1) are of the form 2 n a n , where 

(a„) = 1, 1, 2, 6, 23, 107, 583, 3633, 25444, 197620, . . . 

is OEIS A000772 ["the number of elevated(!) increasing binary trees"]. More generally, 
any quadratic polynomial leads to a solvable model. 

On another register, if (f>(y) has only rational roots, then its partial fraction expansion 
only involves rational coefficients. The inversion problem is then of the type 



Y j r ] \og(l-a J T ] ) = z, 



Thus, by inversion, T is an algebraic function of e z . This is in particular the case for 
Stirling polynomials, such as (f>{y) = (y + l)(y + 2)(y + 3), for which 



,2 2 ... Z 3 2 4 



T(z, 1) = -2 + - = 62 + 66— + 1158— + 28290— + • • ■ . 

74 - 3e 2z 2! 3! 4! 

(This last example is from [6j p. 30].) 

Other interesting cases for combinatorics are 4>{y) = eV an d <j>{y) = (1 ~ J/) _1 i lead- 
ing, respectively, to increasing Cayley trees (so-called "recursive" trees) and increasing 
Catalan trees ("plane ordered recursive trees", also known as "PORTs"); see [3 §1]. The 
corresponding EGFs, T(z) = T(z, 1) are 



T(z) = log^ = E^- 1 )^ (0(2/) =e y ) 

n>l 

T(z) = l-VY^2~z = ^(l.3---(2n-3))^ (</>(</) = (1 - y)' 1 ) ■ 



We will not discuss them further as they are out of our scope, since they concern non- 
polynomial forms (see however Scherk's result relative to the normal ordering of (e x D), 
Proposition A. 2 in the Appendix) ■ 



Note 18. Algebraic reduction of X r D s after [101119] . One may apply the same procedure 
as in Note 



14 



28 to obtain coefficients of the normal form of (X r D a ) n written as 



(X r D 3 ) n = X r ' 



X r D a 
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FIGURE 15. A particular diagram (forest) associated to a form 
(4>(X)D + p(X)), comprised of one rooted tree and two planted trees. 



We first assume that r > s. The action of X r D s on the exponential e x is 
(71) 

equivalently, 
(72) 



(X D) e =X 2-, U f ^ I fc | X 6 ' 

k k J r,s k \ J r,s 



3 P =1 



j + n(i — s) 



i! 



Then the comparison of (71 1 and ( |72[ ) produces 



fc I 

and extraction of coefficients yields 



(73) 



) -^E(- i ) fe " j (-)no-+(p- i )(-- s )) £ ' 

J r.s 3 = s V / P=l 



a formula, of which, remarkably enough, Scherk had non-trivial cases (see Proposition A. 3 



p.[77|. The case r < s gives rise to similar coefficients, as results from the duality argument 
of Note[2] p. [11] ■ 



6.3. The form (<f)(X)D + p(X)) and planted trees. We now have gates of the 
form X s (arising from the monomials contained in p{X)) in addition to the earlier 
ones, of form X r D that originate from 4>(X)D. In combinatorial terms, there is 
thus the additional possibility of "planting" any combination of members of the 
family of increasing trees on a root, whose outdegree has possibilities dictated 
by the monomials of p{X). Graphically: 

i t v + ^ + ... 

p{X) 1 X X 2 X 3 

A connected component is then either a planted tree 72.^, or a rooted tree of ■ 
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The corresponding specification is (with as before): 

TZ^ = ^2 Ps [z d * (u + T+) ') (planted trees) 

s>0 

g = Set(^ + wT ) (all graphs). 

We then have: 

Proposition 12. The graphs associated with the normal ordering of {<p{X)D 
p{X)) have the trivariate generating function 

G{z;u,v) = e R *( u ' z *> . e "^(«^), where (u, z) := [ p (u + T 4 '(u,t)) dt, 



id is specified by ( 68 1 and ( 69 ) . 



Note 19. Yet another PDE. In the case of the PDE 

(74) ^F(x,z) = <j>(x)-^F(x,z), 

with initial value condition f(x,0) — f(x), the solution (cf Proposition 111 can be de- 
scribed in the following suggestive way. 

Let Q(y) be the primitive function ofl/<j>(y) and Q( _1 '(a;) denote its inverse; the 



solution of ( 74 1 is 

F(x,z) = f(Q^(z + Q(x))). 
This is, for instance, the form given by Dattoli et al. in |30| . Equation (1.2.18), p. 



Equation (1.2.25) of that paper treats the general case of Proposition 12 above. See also 
the discussion in [16], especially the formula (1) there. We shall briefly return to the 
combinatorics of increasing trees, when we discuss multivariate extensions of these results 
in Section |9[ Note [25] p. [62] below ■ 



7. Binomial forms (X a + D b ), lattice path models, and continued 

fractions 

The purpose of this section is to develop from first principles combinatorial 
models that are now expressible as lattice paths. The approach is an alternative to 
the gates-and-diagrams model of previous sections, though it is strongly related — 
we shall indeed present a correspondence in Subsection |8.1[ The main focus here 
is on the reduction of powers of the "binomial forms" (X a + D b ). Contrary to 
what has been the case until now, the reductions are often far from being explicit, 
in terms of either coefficients or generating functions. Nonetheless, this section 
reveals interesting connections with other areas of combinatorics. In this context, 
the "Fermat forms" (X r + D r ) stand out, due to a simple relation with a yet 
mysterious class of continued fractions. 

7.1. Normal ordering and lattice paths. We first revisit briefly the normal 
ordering problem. Given the (not necessarily normal) representation H of an op- 
erator, which may be a power [)™ or a generating function e zt) , its normal ordering 
is, by definition, of the form 

m(H)= c a ,pX a D^ 

a,/3GZ >0 
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for a family of constants c a ^ = c a ,p{H) that depends on if. Of special interest for 
our subsequent discussion is the constant term c q, which we shall rewrite as 

C.T.(ff) = co,q(#)- 
Note that, since D@ , for > 1 is cancelled by a constant, we have 

m(Hi) = J2c a , x a , 

ot 

where 'if/' represents the application of the operator if to the function / = f(x) 
and 1 denotes the constant function equal to unity. We then have the obvious 
constant-term identity 



(75) 



C.T.(ff)= Hot\ x=Q 



In this section, we shall mostly be concerned with constant term identities. 

There is a known way to represent the normal ordering process as a transforma- 
tion of lattice paths in the cartesian plane Z> x Z> . This is for instance reviewed 
in the elegant discussion of Varvak |127j ; see also Subsection 8.1 below for a quick 



review. Our approach has some analogy, but it also differs in essential aspects 

First, we note that the linear differential operator D is characterized by the wa 1 
it acts on the canonical basis {x k } k _ of monomials, in which it is representee 11, 
by the infinite matrix 



D 







1 • • • 

2 • • 

• 3 ■ 

• • 4 



D 



1,3 



■■3-1], »,J>0. 



Similarly, the linear multiplication operator is represented by the matrix 



X = 



• 

1 

• 1 



Xij = [z = j + ll, i,j>0. 



If H is the matrix obtained from an {X, f)}-operator if by the substitutions I^X 
and D H> D, then, the constant term of if is obtained as 



(76) 



C.T.(if) = (1,0,0,...)H(1,0,0, 



that is, the constant term of if equals the upper left corner element of H. 

We can now avail ourselves of the basic isomorphism between matrix products 
and paths in graphs (see, e.g., p. 9] or [5lJ §V.5.1]). We consider here digraphs 
(directed graphs), whose vertices are the integers Z> . The graphs also have edges 
that are allowed to bear multiplicities, with the multiplicity of a path being the 
product of the multiplicities of the edges that it comprises. Then, the transposed 



Given a predicate P, we denote by [ P ] its indicator, whose value is 1 if P is true and 
otherwise (Iverson's notation). 
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1 _ 1 

Figure 16. The Weyl graph corresponding to (X + D). 




matrix X is the incidence matrix of the following graph (with edge- weights under- 
lined) 

_ 1 

X 



®^r*©^® 

while the transposed D corresponds to 



D : ©-.©- 0- ©-.©- 

The analogy with the way X and D operate is striking — just interpret state k as 
representing the quantity x k . 

Of special interest is the graph similarly constructed from the matrix (X + D) 



associated to the operator [X + D); it is displayed in Figure 16 and we propose 
to call it the Weyl graph. To each monomial f = f i • • ■ f n in X,D, we associate a 
path 7r(f) obtained by scanning f backwards and transcribing each letter as either 
a leftward move (for a D) or a rightward move (for an X): 



(77) 7r(f) = tti ■ ■ ■ 7r n , where ttj 



a leftward move (D) if f„_j = D 
a rightward move (X) if f n -j = X. 



Multiplicities along a path are to be cumulated multiplicatively, as said. In addition, 
a D-move from vertex is simply to be interpreted as carrying a weight (since 
D ■ 1 = 0), which must then make the weight of the whole path to vanish — to take 



care of this case, its is convenient to add a "sink node" (tagged by —1 in Figure 16) 
to our graph. With these conventions, we can state: 

Proposition 13. Consider a non-commutative monomial f in X, D. The constant 
term of its normal form is nonzero if and only if the associated path 7r(f) in the 
Weyl graph of Figure \ 1 6[ starting from vertex 0, returns to vertex 0. In that case, 
this constant term is equal to the multiplicative weight of the path 7r (f) as described 



in (77). 



Proof. The constant term is given by the matrix form of ( 76 ) . Here the matrix H is 
the one corresponding to the product f i • ■ ■ f„ of the Xs and Ds that f is composed 
of. This constant term also equals the upper left corner of the transposed matrix 



H = 7Ti • • • 7r n (see Equation (77)). Then, the classical isomorphism between matrix 



products and graphs yields the statement. □ 



For instance, f = XXDD has C.T.(f) = 0, which corresponds to the fact that, in 
the reversed form DDXX, the first D takes us to the sink state. For g = DDXX, 
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we have C.T.(g) = 2, since, to the reverse form XXDD, there corresponds a path 

A 1 A 2 A 1 A 0, 
with multiplicity 1x1x2x1 = 2. By design, this agrees with the interpretation 



of state k as a token for the monomial 



k . 



C.T.{DDXX) = (DDXX) o 1 = 2, since 1 A i Ai 2 A 2i A 2. 

This interpretation also gives back the basic property that the constant term of a 
monomial f — f i ■ ■ ■ f n is nonzero iff each suffix fj ■ ■ • f „ has at least as many Xs as 
Ds and the number of Xs in f equals the number of Ds. 

Finally, a path in the graph Z>o, with edges of the form (J,j + 1) and — 1) 
is classically interpreted as a lattice path of Dyck type |122L p. 221], that is, a 
polygonal line in the cartesian plane Z x Z: start from the origin and simply 
associate a North-East move ("ascent") (+*) to a rightward step and a South-East 
move ("descent") to a leftward step. The multiplicity of such a path is the 

product of the (starting) altitudes of descents. For instance to f) = DXDDXDXX, 
there corresponds, by reversion, the path ir = XXDXDDXD in the Weyl graph, 
which gives rise to the Dyck representation 



XXDXDDXD 
The multiplicity in this case is C.T.(f)) = 1x1x2x1x2x1x1x1 = 4. 

7.2. Fermat forms (X r + D r ) and continued fractions. Motzkin path^Jare 
lattice paths that, in addition to ascents (^) and descents (^J) , are also allowed to 
contain level steps . Flajolet [IS] has built elements of a combinatorial theory 
of continued fractions, which can be viewed as based on the following generating 
function of Motzkin paths: 

1 

(78) F(a, d, I) = 



a d 1 



aid 

1-4 — 



12^3 
1 - *2 - — 



(See also [511 §V.4] for a concise exposition.) Here the variables aj, dj, and £j mark, 
respectively, the ascents, descents, and level steps, with (starting) altitude equal 
to j. A substitution 

(79) cij i y ctjZ, dj ^ 5jZ, £j h> XjZ, 



2 ^ Such paths are associated to an enriched Weyl graph in which self loops of the form 
are permitted. 
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then yields the ordinary generating function of Motzkin paths when multiplicities 
(ctj , 8j , Xj) are present, with z marking length. In this case, the continued fraction 



of (78) becomes 



(80) 



F(z) = 



1 



1 - X Q z 



1 - Aiz 



U\<)iZ 



l-Aoz- 



a 2 S 3 z 2 



which is known as a Jacobi fraction or J -fraction |105l 1131] 

We note that the continued fractions in ( 78 ) and ( 80 ) are ordinary generating 



functions, whereas we have been considering so far exponential generating functions 
in association with e zi] . The connection is via the formal Laplace transform C 
defined as 



C 



OO 



/' fn i 

n=0 



oo 

E 

n=0 



which is (formally; sometimes asymptotically, or even analytically) representable 
by 



C[<p(z)] 



e t (fi{tz) dt. 



Thus, we shall obtain here constant term identities for the OGF 

1 



1-4' 



instead of the more customary e zf) . 

As a first illustration, we revisit the normal ordering problem relative to fj = 
(X + D). The constant term of e zt> is in this case the EGF of "closed" diagrams; 
i.e., diagrams with no free input or output that are relative to X-gates and le- 
gates. For size 2n, these are enumerated by the odd factorials, 1 • 3 - • • (2n — 1), 
with EGF equal to e z / 2 , as we saw already. On the other hand, the interpretation 



as lattice paths, with weights (79 1 of the form 
(81) 



aj = 1, 



leads to a continued fraction ( 80 ) that must correspond to the OGF of odd factorials. 
We thus obtain: 

Proposition 14. The normal ordering of (X + D) corresponds to the continued 
fraction expansion 



C.T. 



1 



1 -z(X + D) 



C.T. £ 



,z{X+D) 



]T[1.3...(2n-l)]z 2 " 

n>0 



1-Z 2 



2-z 2 



3-z 2 
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Analytically, this formal continued fraction is easily derived as a special case of 
GauB's expansion of the quotient of contigous 2-F1 hypergeometric functions [1051 
1131] . Asymptotically, it is associated with the expansion at infinity of the Gaussian 
error function. Combinatorially, its proof [45] reduces to a bijection originally due 
to Francon and Viennot [55], itself based on a linear scan of the arch diagram 
representation of involutions (see, e.g., [HI Ex. 5.10, p. 333]). The expansion of 
Proposition [14] is finally tightly coupled with Hcrmitc polynomials, hence the name 
"Hermite histories" chosen by Viennot for Motzkin-Dyck paths weighted according 
to 

We next turn to the general normal ordering problem of the Fermat form (X r + 
D r ), with r a natural integer. 

Proposition 15. The normal ordering of (X r + D r ) corresponds to the continued 
fraction expansion (with x r = x(x + I) ■ ■ ■ (x + r — 1)) 

1 

1 - z(X' r + D> ) 



C.T. 



= C.T. £ 



1 



V-z 2 



(r + iy 



(2r + l) r ■ z 2 



1 - 



Proof. What is involved is the collection of all Dyck paths in the Weyl graph, such 
that Xs and Ds go by groups of r identical letters; for instance X 3 X 3 D 3 X 3 D 3 D 3 , 
for r — 3. Then, only vertices whose values are multiples of r are reachable. By 
grouping steps r by r, these paths are seen to be equivalent to paths in the nearest- 
neighbor graph with vertex set Z>o, but with weights taken according to the rule 
(with x— = x(x — 1) • • • (x — r + 1)) 



= 1, 



Sj = far, 



tj = o- 



An appeal to the continued fraction theorem (as summarized by (78)) applied to 
these condensed paths completes the proof. □ 



For r — 2, we can make use of the computations of Subsection |5 . 1 1 relative to the 
normal ordering of (X 2 + D 2 ) n , to derive the continued fraction identity: 



(82) 



1 



y/ COs(2z) 



1 - 



1 



1-2 



3 • 4 • z 2 
5 • 6 • z 2 



This fraction can otherwise be deduced from expansions due to Stieltjes and Rogers 
and relative to £[sec e z]; here, 6 = |. Rescaling z to z/y/2 leads to a continued 
fraction for the OGF of the sequence 



1, 1, 7, 139, 5473, 357721, 34988647, 



which is OEIS A126156. 
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It is unclear whether explicit expressions can be distilled out of the expansion 
of Proposition |15| when r > 3. The most intriguing questions in this range is to 
identify the special functions associated to the simplest case r = 3, namely, 



* 3 (*) 



1 



1 • 2 • 3 • z l 



4 • 5 • 6 • z 2 

(83) 1 - 



7 • 8 • 9 • z 2 

1 



= 1 + 6 z 2 + 756 z 4 +458136 z 6 + 765341136 z 8 + • • ■ . 

Note 20. On cubic continued fractions. Only a few cubic analogues of $3 are known. One 
group is related to the Dixonian elliptic functions (261 127] : for instance, with a coefficient 
law that alternates, depending on the parity of levels, 

f°° _ z 

I e~ t sm(zt)dt = s = , 

Jo , l-2 2 -z A 



3 ■ 4 ■ z A 



where the elliptic function sm(z) is defined as inverse of an Abelian integral: 

r sm(z) dy 



/o (i - y 3 ) 2/3 ' 

Another group, of Stieltjes-Rogers-Ramanuj an- Apery fame, is related to the Hurwitz zeta 
function, 



^ + 1 ) = £(^TIP 



k 

and it contains, for instance (see [H p. 153]): 
C(3,a:+1)= 



2x{x + l) + 



1 + 



3x(x + 1) + 



1 + 



Wx{x + 1)H 

which is somehow related to Apery's proof [126] of the irrationality of C(3)- 



From the previous examples, it is easily realized that the general scheme giving 
rise to explicit continued fraction expansions is when () is of the form 

x r D s + x s D r + hjX j D j . 

The steps in the cartesian plane are now of the three vectorial types ( r ^ s ), ( s 7 r )' 
and (?), which can be collapsed by a linear change of coordinates to the three 
types that serve to form Motzkin paths — hence continued fractions. The weights 
are invariably a polynomial function of the altitude (i.e., the index k). In this way, 
continued fractions with polynomial coefficients of all degrees can be constructed, 
though both the special functions aspects (the existence of explicit forms) and the 
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combinatorics (bijections with simple "natural" combinatorial structures) remain 
unclear at this level of generality. 

Note 21. Horzela structures and (XD 2 + X 2 D). Imagine! 21 "] a universe where particles 
may be subject both to fission (a particle gives rise to two particles) and fusion (two 
particles merge to give rise to a single particle). The diagrams are those associated with 
(XD 2 + X 2 D), where gates are of type either Y or its horizontally flipped image. Thus, 
the graphical representations are a complex network of trees and "inverted" trees. The 
ordinary generating function H(z) of the diagrams with one root (one input) and one 
surving particle (one output) is then 



H{z) = 

1 



1 

l 2 ■ 2 



1 - 



2^ • 3 ■ z^ 



1 - 



= 1 + 2 z 2 + 28 z 4 + 1256 z 6 + 129904 z 8 + 25758368 z 10 H . 

Such ff-structures are loosely evocative of cellular decompositions (combinatorial maps) 
of surfaces of arbitrary genus, a subject of active research (see, e.g., |22II24) and references 
therein) ■ 

Whenever available, continued fractions are associated with a rich set of iden- 
tities, due most notably to their connection with orthogonal polynomials [151 11051 
1131] . and they potentially give rise to efficient computational procedures [81]. 

7.3. The general binomial case (X a + D b ). The general correspondence ex- 



pressed by Proposition 13 (for monomials) is still applicable to the normal ordering 
of (X p + D q ). Thus, there exists a transcription in terms of paths in the Weyl 
graph, where the allowed steps are either rightward moves of amplitude a or left- 
ward moves of amplitude — b; the multiplicity of a path, as before, is the product 
of the starting altitudes of descents. 

However, when a ^ b, the connection with continued fractions is lost, as the 
case is no longer reducible to the Dyck paradigm. The simplest instances are 
(X 3 + D 2 ) and (X 4 + D 2 ) (or their duals, (X 2 + D 3 ) and (X 2 + L> 4 )). These 
formally correspond to the anharmonic quantum oscillator with a cubic or quartic 
potential. The extensive quest for explicit solutions in this context indicates the 
difficulty of finding connections with the most classical special functions. For the 
record, we tabulate here the following constant terms: 

C.T. [{D 2 +X 3 ) n ] \ n=Q 1Q : 1,0,0,0,0,864,0,0,0,0,1157815296 
C.T. [{D 2 +X 4 ) n ] | n=Q "" 1Q : 1,0,0,24,0,0,49536,0,0,828002304,0. 

These seem not to be related to existing sequences in the OEIS. 
Note 22. Duchon's clubs. The lattice paths relative to (X 3 + D 2 ), but when weights of 
both leftward and rightward steps are set to 1, appears in the literature under the name of 
"Duchon's numbers", which enumerate the combinatorial class of "Duchon's clubs" [4ll34], 
The sequence of nonzero Duchon numbers (5s™), 

1, 2, 23, 377, 7229, 151491, 3361598, 77635093, 

is OEIS A060941. In the figurative description of @] p. 53]: 



(84) 



This case was suggested to us by Andrzej Horzela (private communication, 2010). 
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"A club opens in the evening and closes in the morning. People arrive by 
pairs and leave in threesomes. What is the possible number of scenarios 
from dusk to dawn as seen from the club's entry?" 

In this simplified situation (multiplicities of steps are disregarded), we are only consid- 
ering the possible evolutions in time of the club's population. The ordinary generating 
function S(z) is then an algebraic function, here of degree 10, 

zS 10 + 5z<5 9 + 5z8 8 - WzS 7 - 15z<5 6 + llz<5 5 + (15a - 1) <5 4 + (1 - IO2) 8 3 - 5z<5 2 + 5z8 - z = 0, 
and one has the following explicit expression 



Ssn ~ £ 5n + i + 1 ( 



! / 5n + i \ I 5n + 2i 
n — i I \ i 



It is interesting to note that the case of (D 2 + X 3 ), or, equivalents (X 2 + D 3 ), in the 
first line of (84 1, corresponds to the situation where, furthermore, we count complete 
evolutions, in which, additionally, identities of individuals are taken into account ■ 



8. Related frameworks 

In this section, we return to the the gates-and-diagram model of creation- 
annihilation operators of Sections [2}|6] We first discuss a model of the reduction 
to normal form that is expressed in terms of rook placements on a board. This 
rook model can be derived from first principles (Wick's Theorem, cf Note [TJ p. [9j 
and |127j ). but it can also be attached to the basic construction of diagrams and 
the equivalence asserted by Theorem [T] As we explain in Subsection |8.1| one of 
the interesting features of the latter approach is the possibility of relating diagrams 
to the lattice-path methods of the previous section, Section [7j the connection is 



achieved by a simple "scanning algorithm". Next, in Subsection 8.2 we briefly 
revisit diagrams within the framework of g-analogue theory, where the q-difference 
operator A replaces the ordinary differential operator D and crossing numbers of 
(plane embedded) diagrams are shown systematically to produce g-analogues. 



8.1. Rook placements, lattice paths, and diagrams. We first describe some 
simple combinatorics that relates diagrams and rook placements on a chessboard. 
This thread closely follows an insightful article of Varvak |127j ; see also [T31 II IT] . 
The message here is that one can describe the complete history of the construction 
of diagrams by means of certain kinds of lattice configurations. 

We fix a basis T-L of gates that, for notational convenience, we take to be un- 
weighted. In other words, we are considering a polynomial with coefficients in 
{0,1}, 

m 

I) : ^ A" /.)" . 

i=l 

for a finite set of distinct pairs (rj,Sj) G Z>o X Z>o. (The general case is easily 
treated by suitably accommodating weights.) Let gj represent a generic gate of 
type X Tj D Sj . A diagram 5 of size n is determined by the collection (g^ , . . . ,gi„) of 
its gates, where gj . is the type of the gate associated to the inner node labelled j, 
together with the interconnection pattern, which describes the way the outputs of 
gates are connected with the inputs of some other (later arrived) gates. 
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/ s 




FIGURE 17. The correspondence between the composition of a dia- 
gram and its contour, 

interpreted in the discrete plane. 



As we know (Note [I]), a diagram is uniquely associated with a particular reduc- 
tion of a monomial 



(85) 



X r ^D Si « X r ^-iD s 



using at each stage either of the two rewrite rules DX n- XD or DX M> 1. We 
then define the contour of 6 as a word over the extended alphabet {X, D, |}, where 
"|" serves as a separator and is called a pin, as follows: 

(86) cont(£) := \X r ^ D s ^ \ X ri ^ D s ^ | • • • \X^i D s ^ . 

The contour in this sense is thus an unambiguous representation^ 22 ] of the gates that 
5 is comprised of. Next we represent the contour as a polygonal path in the discrete 
plane Z x Z, with X being interpreted as the vertical unit vector (0,-1) and D 
being the horizontal unit vector (1,0). In the discrete plane, this polygonal line 
determines what is known as a Ferrers board |25[ 1127] , see Figure |17[ where the 
pins are represented by arrows. 

In order to obtain a bijective encoding of circuits, one needs to augment the 
contour cont(<5) so as to encode all the information relative to interconnections of 
links in the diagram S. We now consider the Ferrers board whose upper envelope 
is the contour. Since the order of application of operators in a monomial is from 



the right, the contour (as of (85) or (86)) is scanned from the right. The procedure 
is as follows (Figure 18): 



In each vertical column (which corresponds to a letter D) do one of two 
things: 

22 The pins serve to disambiguate the parsing of a word over the alphabet {X, D}. They are 
needed in a few cases; for instance, if f) contains (X + X 2 ), since X 2 = X ■ X can be parsed in 
two different ways; or in the case of (X + D + XD), since X 2 D 2 = X ■ XD ■ D = X ■ X ■ D ■ D. 
They are superfluous in cases such as (X + D), (X 2 + D), (X 2 D 3 ), and so on, as considered by 
Varvak lT27l. 
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FIGURE 18. The correspondence between a diagram and a rook place- 
ment in the square lattice. 



- either put a single dot (a "rook") in one of the column's cells: a dot in 
the jth cell from the bottom of the board means that the jth outgoing 
link (conventionally starting from the right) present in the partial dia- 
gram at this stage is connected to the currently active input link of the 
current gate (this corresponds to the D currently taken into account); 

- or put nothing in the column: this corresponds to a dangling (unat- 
tached) Z?-link. 

A moment's reflection should convince our reader that at most one dot/rook can be 
placed in each line (since the output of a gate can be "closed" at most once by the 
input of a later gate) , as well as in each column (by construction) . This is precisely 
the rule that constrains the placement of non- attacking rooks on a chessboard — 
here in the case of a board with unconventional right and upper boundaries [1111 
Ch 7]. 

We now arrive at a general statement, which is a version adapted to our needs 
of Varvaks' Theorem 3.1 in [127]. 

Proposition 16. The coefficient of X a D b in the normal form 9^(1}™) is equal to the 
number of rook placements in Ferrers boards, whose contour is consistent with fj™, 
that have a rook-free rows and b rook-free columns. 

Varvak's proof essentially amounts to an appeal to Wick's Theorem (Note [lj 
p.|9|. The proof given above, without being radically different, amounts to applying 
to diagrams a scanning algorithm whose general scheme is as follows: 

Scanning algorithm. Gates are scanned in increasing order of their 
labels; gate types, encoded by corresponding vectors of Z x Z, are gen- 
erated (these form the steps of a "contour" read from the bottom right); 
additional information (a sequence of numbers) is supplied to specify the 
interconnection pattern of the new gate with its predecessors. 

A variety of encodings are possible, due to the flexibility of the coding con- 
ventions, when implementing the scanning algorithm. For instance, regarding the 
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reduction of (X + D), we may associate a northeast step (*) to an X and a south- 
east step (_,) to a D. The constant term in the normal form of (X + D) 2n is then 
associated with the collection of all Dyck paths of length 2n, each such path being 
augmented by a sequence of numbers that serves to encode the interconnection 
pattern of gates in a particular {X, Z3}-diagram. In this specific case, it can be 
seen that the allowed number sequences are such that an ascent has one possibility 
whereas a descent has £ possibilities, if it corresponds to a step with initial alti- 
tude I. The augmented paths produced by the scanning algorithm thus correspond 



exactly to the weighted Dyck paths considered in Equation (81 ) and Proposition 



14 



to be later revisited in Proposition 17 and Figure [21] in relation with g-analogues. 
Beyond this particular example, the scanning algorithm generally links circuit-based 
models and the direct matrix-Weyl graph approach of Section^ 

8.2. q— analogues and the difference operator. In this subsection, we propose 
to discuss briefly the way the theory of gates and diagrams leads in a systematic 
way to q-analogues. The starting point is the q-difference operator A = A q defined 

by 

(87) Am = /y -/(') . 

(q - 1)X 

We shall take q to be a real number in [0, 1] and note that, as q — > 1, the operator 
A q becomes the standard derivative operator D. The operators X and A satisfy 
the commutation relation 



AX - qXA = 1, 

to be compared to ([!]). A normal form, with all Xs preceding all As can always 
been attained by the rewrite rule analogous to 

AX — > 1 + qXA. 

(The book by Kac and Cheung |76j provides an undemanding introduction to basic 
properties of such operators.) 

We shall now build in stages a combinatorial interpretation of arbitrary com- 
positions of As and Xs. To start with, we observe the effect of A on (formal or 
analytic) power series: if f(x) — fnX n , then 

1 _n-l 



(89) A q {f){x) = Y J fn l Y 



n>0 

In other words, A operates linearly and, on the monomial x n , its effect is to produce 
a monomial of degree (n — 1): 

(90) Ax n = [n]x n - 1 , where [n] = [n] q := y-^- = 1 + q + ■ ■ ■ + q 71 ' 1 , 

using classical notations. 

Combinatorics of Ax n . From ( 90 1 , the operator A admits an obvious inter- 
pretation: think of the monomial x n as a row of n occurrences of the variable x; pick 
up (in all possible ways) one of the x-occurrences and record with a power q k ~ 1 the 
situation where the fcth occurrence from the left has been picked up; finally replace 
the chosen occurrence of x with the neutral element 1 (the identity) . (Under this 
form, it is apparent that the A-operator is a deformation of the standard derivative 
operator: see Notc[T]) 
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A visual image of the action of the A operator on an arbitrary monomial can be 
given in terms of "speed dating clubs" as follows. 

Imagine a longish hall in which there is a long row of tables. At each table 
there sits one cc-element. A particular operation consists in letting in, 
at the entrance of the hall, on the left, a A-element, who will eventually 
pick up a table. However, for each table that the A-element passes by 
(but does not pick up), he has to pay for a drink (the cost of drinks is 
recorded by q). Once he settles for a table, the ^-element at that table 
ceases to become available for further drinks and solicitations. What 
the A operator does is simply to keep track of all the possible scenarios, 
when one A-individual is let in. 

For instance, with the example of Notefl] 

9° q 1 Q 2 9 3 

A(xxxx) — fcxxx + xfcxx + xxfcx + xxxfc — [4] 9 x 3 . 

Combinatorics of A(x n f) = AX n (f). In our treatment of operator calculus, 
an identity if = 03 between operators means that, for an arbitrary / (on which 
the operators act), we have iif = QJ/. Here, the nature of / is immaterial and, in 
particular, when dealing with normal forms, quantities such as A/, A 2 /, . . . are to be 
considered as non-simplifiable. We can then amend the combinatorial interpretation 
of the previous paragraph as follows. Imagine now that the longish hall has an 
exit on the right, leading to a courtyard (biergarten) designated as "/", where A 
elements can accumulate if they haven't picked up an x-element in the hall. This 
situation is seen to model the action of A on (x n f) or what amounts to the same, 
the action of AX n on an arbitrary /. Here is an example: 



Ax 3 / = A • • • / = 1 



1° 



I 2 



i.. / 

• i. / 
..i / 

• • • A/ 



[3] q x 2 f + q 3 X 3 A/. 



(As a simple exercise, the reader may wish to verify combinatorialy the general 
identity A(/ • g)(x) = Af(x) ■ g{x) + f(qx) ■ Ag{x).) 

Compositions. The interest of this visual image is that it describes well what 
goes on upon iteration. For instance, we can see that 

A n (a J w ) = [n]-[n-l]---[2].[l] = [n]!, ) 

where the right hand side gives the generating function of permutations counted 
according to the number of inversions 23 ! see Figure 19 This computation is also 



nothing but the reduction of A n X n (l) to normal form. 

More generally, given an arbitrary term X and A, each possible expansion that 
it can give rise to, when applied to an arbitrary /, can be described by a succession 
of operations of adding a table with an x-girrjor launching a A-boy into the game. 



23 With our notations, the number of inversions is the number of pairs of values (J, k) such 
that j < k and the value j is placed on the right of the value k. 

2 ^ The operator X clearly corresponds to placing a new table at the beginning of an existing 
hall-courtyard configuration. 
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values: 



4 
3 
2 
1 



places: (1) (2) (3) (4) 

FIGURE 19. A permutation of size 4, such as a — 

where the first value 1 goes to place 3, and so on, corresponds to a 
particular expansion of A 4 [a; 4 ]: it is seen that the number of crossing 
links equals the number of inversions in the permutation. 



/ 1 2 3 4 \ 
I 3 2 4 1 )' 



Diagrams. The discussion above leads to a natural extension of the notion 
of diagram. A quantity X r A s will again be represented by a gate in the sense 
of Section [2] Equation (j5J). However, when gates are composed to form graphs, a 
definite convention should be observed. 

All edges are drawn as segments or half-lines parallel to the axes. Each 
new gate 7, which is added to an existing diagram 5 (where the latter 
involves only smaller labels) is placed on the north-west of the diagram. 
The inputs of gates 7 are drawn horizontally, pointing to the right; the 
outputs are drawn vertically, pointing upwards. Inputs of 7 not con- 
nected to an output of 8 are prolonged as half-lines to the right of the 
diagram. Outputs of 7 not connected to a later gate's input are prolonged 
upwards as half- lines. See Figure [20] 

This convention corresponds to the fact that, in a gate of type (say) X 2 A 3 , the first 
input corresponds to a first application of A, and so on. A diagram represented in 



this way will be called an embedded diagram (Figure 20). The number of pairs of 
edges that cross is called the crossing number of the embedded diagram. 

The observations relative to the combinatorics of A and the case of permutations 
then immediately lead to the following statement. 

Theorem 2 (A-Equi valence Principle). Consider a polynomial t) with normal form 
form 

(91) J) := Yj ™ r , s X r A s . 

(r,s)en 

Then the normal ordering of the power h™, 

(92) 91(f)") = ]T c n ^ b (q)X a A\ 

is such that the polynomial c n ^ a ^(q) coincides with the total weight of (labelled) 
embedded diagrams that admit % as a basis weighted by w, have size n, and are 
comprised of a outputs and b inputs, where the variable q marks the number of 
crossings. 
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(4) (A 2 



(SI 



A 2 



(A 2 



2) (X 2 



ax 



A 2 



©(X 2 ) 



(T)(X 2 



(T)(x 2 



FIGURE 20. Embedded diagrams formed with two gates of type X 2 and 
two gates of type A 2 : (left) a diagram with 4 crossings and a = b = 0; 
(right) a diagram with 5 crossings and a = b = 1. 



In the case of the reduction of expressions of type X Tl A Sl ■ ■ ■ X rn A s ™, Mendez 
and Rodriguez |98| developed an interpretation similar to ours, in terms of crossings. 
This theme is further explored by Mansour, Schork, and Severini [9 2) in the context 
of "Wick's formula" . We also refer to the early work of Katriel, Kibler, Solomon, 
and Duchamp [HUES [S3] and to the subsequent studies [IS H3 E3 EEH] for results 
relative to the g-Stirling and Bell numbers. 

The interest of previous developments, where the A difference operator replaces 
the usual derivative D is that they systematically lead to a natural class of q- 
analogues, which may or may not be classical. The case of Stirling numbers ((XA) 
and generalizations) being well covered in the literature, we limit ourselves here 
to examin e tw o cases: (X + A) n , which leads to a g-analogue of the involution 
numbers (^3.1 above); (X 2 + A 2 )™, which is related to alternating permutations 
(^5.1 above). Our brief treatment will be along the lines of Section [7] dedicated 
to binomial forms (X a + D b ) n , with emphasis placed on "constant terms" and 



continued fraction aspects. We state (cf Proposition 15 1: 

Proposition 17. Consider the constant term of the normal form of (X + A)' 

I n (q) :=C.T.yi[(X + A) n ]. 

Its ordinary generating function satisfies 

(93) ' - 



n>0 



1 



[1], 



[2], 



1 - 



[3] 



Furthermore, there exists an explicit form for I n (q), 



(94) 



1 " 



fc(fc-l)/2 



2n 
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FIGURE 21. Three representations of the involution 

a = {(1 4), (2 6), (3 7), (5 8)}. 

From top to bottom: (a) the embedded diagram; (b) the associated 
Hermite history; (c) the chord diagram. The number of crossings 
equals 5. 



Proof. First, the embedded diagrams assocated with the reduction of powers of (X+ 
A) are similar to the ones for permutations (but the distribution of integer labels 
differs): see Figure 21 and compare with Figure 19 Under this form, it is easily 



recognized that the embedded diagrams are isomorphic to the arch representations 
of involutions, themselves in bijective correspondence with "Hermite histories" ; that 
is, weighted Dyck paths where a descent from altitude j has multiplicity j. 

To see the bijective correspondence 5 ] with Hermite histories, start from the em- 
Scan the values from 1 to n = 1v and associate 
a descent otherwise. This gives 
When a descent from altitude k 



bedded diagram (Figure 21 (a)), 
an ascent to a value that is smaller in its cycle, 



rise to a Dyck path exemplified by Figure 21 (6). 
is produced, say at time r, there are k possible choices for the 2-cycle that could 
be closed, the possibilities being enumerated by [k] q . Number conventionally these 
possibilities according to "age" (oldest first); that is, the open cycle with largest 
element is numbered the second oldest is numbered 1, and so on. The rank of 
the possibility chosen appears to be equal to the number of crossings that the cycle 
ending at r has with its preceding cycles. Thus, Dyck paths where any ascent is 



25 This correspondence is classical and due to Francon and Viennot (see, e.g., 1451 1551 or the 
accounts in the books |51l p. 333] and ,59, §5.2]). 



56 



P. BLASIAK AND P. FLAJOLET 



weighted by 1 and a descent from altitude k is weighted by [k] q are in bijective 
correspondence with involutions, where each crossing is weighted by q. The expan- 



sion ( 93 1 now results from the basic theorem of continued fraction combinatorics | 
Th. 11. 



The explicit form ( 94 ) is none other than the celebrated Touchard-Riordan for- 



mula |108( 1110] 1125] in the enumeration of chord crossings: see Figure 21 (c). □ 
Similarly, we have: 

Proposition 18. The constant terms In(q) of the normal form of (X 2 + A 2 ) 71 , 
satisfy 



(95) («)**• = 



1 



n>0 I 



[1], [2], 



z 



2 



1 



[3], [4], 



.2 



[5], [6], 



This is a g-generalization of the continued fraction attached to 1/^/cosz, for 
which the authors have not found an explicit form — perhaps some of the methods 
developed by Josuat- Verges [TTJ [73] [73] could be relevant p*| Also, it would seem 
interesting to elicit possible connections with q-analogs of expansions of trigonomet- 
ric functions, as considered by Prodinger |106l 1107] . At any rate, we can observe 
that the continued fraction expansions provided by Propositions [17J and [18] are not 
devoid of content: they can in particular be employed to derive useful asymptotic 
information, as is exemplified by several studies [13 [EH E7J [SS] . 

9. Multivariate schemes. 

The principles underlying the construction of circuits by means of gates can 
be extended painlessly to certain multivariate calculi and we provide here brief 
indications to that effect. Algebraically, we now consider a family of operators 
Ax, ■ ■ ■ , A r , Bx, . . . , B r , satisfying the partial commutation relations (expressed in 
terms of the Lie bracket [•,■]): 

(96) [Aj,Bj] = 1, (j = 1, . . . r); [A jt A k ] = [A 3 ,B k ] = [Bj,B k ] =0(J¥= *); 

A faithful model is that of (multivariate) differential algebra where we interpret 
the operators as acting on functions f(xx, ■ ■ ■ ,x r ), take Aj to be the jth partial 
derivative, Aj :— and Bj to be multiplication by Xj, that is, Bjf := Xjf. We 
shall adopt this suggestive interpretation and write Dj instead of Aj and Xj instead 
of Bj. (In concrete examples, we may also name variables and use, for instance, in 
the case of variables {x, y}, the notations d x , d y , X, Y.) Obviously, any polynomial 
in the operators Xj,Dj has a normal form in which all the As precede all the Ds. 
Furthermore, we may freely adopt the additional convention that variables obey a 
standard ordering x\ -< x-i -< £3 -< • • • , so that Xj will be systematically written 
before A& if j < k, and similarly for Dj and Dk- 

The idea is now simply to construct decorated gates, which are gates as considered 
before, with the additional characteristics that each incoming and each outgoing 



2fi (2) 

In a recent paper H. Shing and J. Zeng have given an explicit formula for Z,\ in terms of a 

double sum, see Thm. 12 in 11151. 
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vertex is tagged with a variable name (or its index). We shall also agree that the 
tags, in left to right order, follow the standard ordering of variables. Thus for 
instance, the gate associated with XlX|Z?JI?2 has one inner node connected to 
three ingoing edges and five outgoing edges: 



X 1 XlDfD 2 : 




(A tag Xj on an input signifies that derivation is to be effected with respect to the 
corresponding variable, i.e., d Xj , also abbreviated as Dj.) To define diagrams, the 
rule is now the following: Outputs of a gate can be connected to inputs of another 
gate if and only if they have identical variable tags. Equivalently, the edges of a 
graph bear various colours (i.e., tags), and when composing an existing diagram 
with a gate, colours of connecting links must match. 
We then have an easy generalization of Theorem [lj 



Proposition 19. The coefficient of 



XI 1 



X^ D* 1 ■■■ D b r 



in the normal form ^(f)™) is the number (total weight) of (tagged, coloured) graphs 
built out of gates associated with the monomials of\), that are comprised of n gates 
and have dj outputs of type Xj and bj inputs of type Xj, for all j = 1, . . . , r. 



In the physics literature, multivariate normal form corresponding to (96) are 



often referred to as "multimode"; see, e.g., [Tl |9"5I [T5!2] . Explicit forms are likely to 
be quite rare, due to the additional complexity introduced by the need to match 
colours. We content ourselves with a few examples that are of (some) combinatorial 
significance. 

Note 23. The combinatorics of xd y +yd x and the Ehrenfest model. Consider the operator 

d d 
T := x— + y—, 
ay ox 

which serves as a simple illustration of the combinatorial approach to partial differential 
operators via diagrams. Here the gates associated with the operator F are of two sorts: 



■d,, 



A < 



(Ay 



yd x 



A x 



We first state: 

Proposition 20. The operator F satisfies the identity 



(97) 



e Z f( x > y) = f( x coshz + y sinh z, x sinh z + y cosh z) 
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FIGURE 22. A particular diagram associated with (xd y + yd x ). 



Proof. A single application of F to f(x,y) picks up the occurrence of a variable, x or 
y, and either replaces the x with a y, or the y with an x. The diagrams are of the kind 



constructed in Subsection 4.1 and relative to the normal form of (XD): they consist of line 
graphs, which are sequences of edges (drawn vertically, as in Figure [?}. However, edges 
are now of one of two colours (x or y), with the additional constraints that the colours 
along each linear path alternate, as in xyxy ... or yxyx . . .- see illustration in Figure [22| 

Now comes the combinatorial reasoning that justifies ( |97[ ). By linearity, it suffices to 
consider the action of e zT on a generic monomial. First, we claim the identities 

e zF x — (x cosh z + y sinh z) , e zF y = (y cosh z + x sinh z) . 

This corresponds to the facl[^] that a connected diagram is a line graph (as in Subsec- 
tion 4.1 1, whose input and output are tagged by the same letter (either xx or yy) if the 
graph has an even size, but by different letters (either xy or yx) if the graph has an odd 
size. The formula 

(98) e zT (x a ° y b ° ) = (x cosh z + y sinh z) a ° (y cosh z + x sinh z) b ° 

then results from the general property that the product of EGFs enumerates all possible 
distributions of labels (marked by z) among components. □ 

From a combinatorial point of view, what has been done amounts to enumerating 
ordered partitions into m (possibly empty) blocks, with x and y recording the parity of 
each block. In the particular case where bo = 0, ao = Tn, and y = 0, we obtain the 
univariate EGF cosh m z, which enumerates ordered partitions whose blocks are all of even 
size — this is a well-known (and easy) result. 

The analysis above yields as a byproduct a nonstandard analysis of the classical Ehren- 



fest model [3D] (Figure 23 1, which is defined as follows. 



27 Notice the classical 



expansions 



cosh(z) = y ^ — p sinh(z) = 

n=0 mod 2 n = l mod 2 
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FIGURE 23. Modelling of the Ehrenfest urn by the partial differential 
operator xd y + yd x . 



Ehrenfest Model. There are two communicating chambers A, B and m distin- 
guishable particles (say, numbered from 1 to m). At any given instant 1, . . . ,n, 
a particle is randomly chosen to change chamber. 

The problem consists in determining the probability P m (n;ao,a) of having, at time n, a 

particles in chamber A knowing that there are ao particles in that chamber at time 0. 

(The Ehrenfest Model is a simple statistical model of the diffusion of particles or heat in 

a heterogenous environment.) 

Kac |75j provides a complete solution based on matrix algebra and ordinary generating 



functions. Here, we may simply observe that, by virtue of (98 1, the probability is 

P m (n; ao, a) = n\m~ n [x a y z n ] (x cosh z + j/sinh z) a ° (y cosh z + x sinh z) b ° , 

where bo = m — ao and b — m — a. (The factor m~ n transforms counts of sample paths 
into probabilities; the factor n\ is due to the fact that we deal with EGFs.) It is then a 
simple matter to perform coefficient extraction, by successive binomial expansions. 

Proposition 21. The transition probabilities of the Ehrenfest model are given by 



Pm{n,ao,a) - — A m _ a — 



where := [z n ](l + zf(l - z) 3 



The derivation just given of the solution to the Ehrenfest model seems to us as "con- 
ceptual" as can be. It has strong similarities with the one given by Goulden and Jackson 
in |60| . It is further developed in several other papers [391 1461 [48] and in the book [51] , 
pp. 118 and 530 ■ 



We next discuss a class of bivariate first-order operators, given by Equation ( 100 ) 
below, for which the normal form problem is solvable in finite terms (Proposi- 
tion 23 1 . Following [46] , Figure 24 lists some representative operators for which 
the connection with urn processes detailed in the Note [24] provides explicit forms. 
There are also two interesting papers [351 133 published by Dumont in Seminaire 
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Name operator (r) model 

Models propagation of epidemics or genes (Polya- 
2 „ Eggenberger) . Admits separation of variables and 

Polya X d x + y d y corresponds to shuffles of 1-dimensional histories. 

References: [irfl IBlfl 1501 RTf] . 



Friedman's model of "adverse safety campaign". 



Friedman 


xyd x + xydy 


Associated with Eulerian numbers and rises in perms. 
References: [371 1151 l56l l68l l90l \9l\ . 


Ehrenfest 


yd x + xd y 


Ehrenfest model of the diffusion of particles. Associ- 
ated with ordered set partitions and parity of blocks. 
References: Note |23| and 1151 15T1 l68l 1ST] . 


Coupon collector 


yd x + yd y 


Models the classical coupon collector problem. As- 
sociated with surjections, ordered set partitions, and 
Sirling2 numbers. 
References: |46l [5T1 l68l |9I1 . 


Sampling I 


xdx + ydy 


Sampling with replacement. Admits separation of 
variables. 

References: [46l l68l IM) . 


Sampling II 


d x + d y 


Sampling without replacement. Admits separation of 
variables. 

References: [46l 1551 M\ . 



Models records in permutations and is associated 
2 with Stirlingi numbers. Describes the growth of the 

Records X d x + xyd y rightmost branch in a binary increasing tree. 

References: 1461. 



Models a (binary) chain reaction with two types 
of particles: X >-» y 2 ,y k-> X 2 . Corresponds to 
2 j parity of levels in increasing binary trees and models 

Bimodal-2 y d x + x d y fringe-balanced trees. Is solved in terms of Dixonian 

elliptic functions sm, cm. 
References: [271 liSl 1171 H04] . 



Models a (ternary) chain reaction with two types of 
particles: X i-» y^,y >— > X A . Corresponds to parity 
Bimodal-3 j/ 3 d x + x^d y °f l eve l s m increasing ternary trees and is solved in 

terms of lemniscatic elliptic functions si, cl. 
References: [46111?] . 



FIGURE 24. A list of some first-order bivariate operators (100 1 associ- 
ated with solvable urn models and explicit normal forms, together with 
the main characteristics of the models, after [46 . 



Lotharingien de Combinatoire in 1986 and 1996 that deal with combinatorial as- 
pects of powers of special linear partial differential operators: the later one [37] is 
in particular based on Chen grammars that can be regarded as a non-probabilistic 
version of urn processes. 
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Note 24. Polya urn models with two colours. Here is a specification of this model, also 
sometimes known as Polya-Eggenberger model [681 191] . 

Polya Urn. An urn may contain balls of different colours. A fixed set of replace- 
ment rules is given (one for each colour). At any discrete instant, a ball is chosen 
uniformly at random, its colour is inspected, and the corresponding replacement 
rule is applied. 

The modeling capability of the Polya Urn process is stupendous and the literature on the 
subject, mostly probabilistic, is immense. Here, we follow the purely combinatorial line 
of Flajolet et al. [461 147] . and only consider models with two colours (say, X and y). A 
model is then determined by a 2 x 2 matrix with integer entries: 

M= ( " \ V a,/8,7,*eZ. 



7 S 

At any instant, if a ball of the first colour is drawn, then it is placed back into the urn 
together with a balls of the first colour and /3 balls of the second colour; similarly, when a 
ball of the second colour is drawn, with 7 balls of the first colour and S balls of the second 
colour. A figurative rendition of this replacement rule that we may use occasionally is 

x^x a y ; y^x 1 y s . 

(Some authors prefer an additive notation, such as X 1— > aX + j3y, which is evocative 



of chemical reactions.) For instance, the Ehrenfest model of Note 23 is rendered by the 
following matrix and rule: 

e = ( ~* \ j, (x m> x- l y- y 1 ^ xy^ 1 ) . 

(Negative diagonal entries a, 8 mean that balls are taken out of the urn, rather than added 
to it). We henceforth restrict attention to balanced urns, which are such that there exists 
a number s, called the balance, such that 

(99) s = a + /3 = y + 5. 

Given an urn initialized with ao balls of the first colour and bo balls of the second 
colour, what is soughl^ is the multivariate generating function H(x, y, z) (of exponential 
type), such that n\[z n x a y h \ H(x,y, z) is the number of possible evolutions (also known 
as "histories" or "sample paths") of the urn leading at time n to an urn with colour 
composition (a,b). Let to = ao + bo be the initial size of the urn. For s > 1, the total 
number of evolutions is clearly to(to + s) • • • (to + (n— so that H(l, 1, z) = (1 — sz)~ to . 

Let the formal monomial m = x a y b represent a particular urn comprised of a balls of 
type X and b balls of type y. From our introductory discussion of the combinatorics of 
derivatives in Note[l] we can see that the effect of all the one-step evolutions of the urn 
are described by the linear partial differential operator 

(100) r = x a+1 y^+x^y s+1 ^-, a + = 7 + 5. 

ox ay 

applied to m. In the same way, F 2 generates all two-step evolutions, and so on. We then 
have an easy observation |47| : 

Proposition 22. The EGF of histories of a balanced urn with two colours and initial 
composition (ao,&o) satisfies 

H{x,y,z) = e zF (x a °y b °), 



where T is the operator specified in ( 100 1 



28 For balanced urns (only), there is complete equivalence between probabilistic analysis and 
enumeration of histories 46 . 
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In other words, the solution to the urn problem is equivalent to the reduction to normal 
form of the powers T n (equivalently, the exponential e zT ) of the associated first-order par- 
tial differential operator F. It is noteworthy that, under the conditions (two-coloured and 
balanced), this normal ordering problem is solvable. The treatment consists in associating 
an ordinary differential system, 

(101) S : {f t X(t) = X(tr +1 Y(tf, fY(t)=X(tyY(t) s+1 }, 

which admits a first integral that is a polynomial (namely, Y p — X v = 1, with p = 7 — a) 
and which can be explicitly related to the multivariate EGF H(x, y, z). We quote from |46l 
Th. 2]. 

Proposition 23. Two- coloured balanced urns and the associated normal form problem 
for the operator Y in (100 1 are solvable by quadratur^\ in the cas^^a < 0, one has 



\e zT (x°V )] = A*°S(-azA s + J(aT a A a )) °° C (~azA s + J(x~ a A a )) "° . 
The notations are 

s = a + (3, p = /3-a;, t = a + b , 
as well as A = A(x) = (1 - x p ) 1/p , and J(u) := J — ^ p/a y //s ■ The function S(z) 

is defined as the inverse of J(u), namely, S(J{u)) — J{S{u)) — u; the function C(z) is 
given by C{z) = (1 + S(z)- p/a ) s . 

Though the general formula looks rather formidable, great simplifications occur in many 
models of practical interest and the expressions lend themselves to precise asymptotic 
analysis, as detailed in [35] ■ 

Note 25. First-order differential operators, multitype trees, and the method of character- 
istics. The object of interest here is the operator 

m 

A = J2 4>i(Xi, ■ ■ ■ , X m )D 3 + p{Xi, Xm). 
j=i 

What we do here is to extend the univariate framework of the semilinear case of Section [6] 
to m variables. The combinatorial model will once more be expressed in terms of increasing 
trees. We shall write x for the vector of variables (x\, . . . , x m ). We state: 

Proposition 24. The exponential e zA admits a normal form 



/ m 

ZA -R(X,Z) / rri I \ 

e = e >exp I 2_^Tj(-x, z)v 3 
Vj=i 



where the functions Tj and R satisfy the ordinary differential system 



(102) < 



— Ti(x,z) = <Mzi+Ti(x,^),...,i m +T m (x,z)), rr(x,0)=0 
oz 



d 

—T m (*,z) = tt>m(xi+Ti{x,z),...,x m +T m (x,z),) T m (x,0) = 0, 



29 It can be seen that the variable y is redundant, so that the substitution y 1 — > 1 entails no loss 
of generality. 

30 Among negative values, only a = — 1 is of direct relevance to the normal ordering problem in 
its standard form, since, in the present study, negative powers of X are excluded. The case a > 
seems to give rise to similar developments (B. Morcrette, work in progress, July 2010). 
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Name 



operator (T) 



model 



Polya 



w 2 d w + x 2 d x + y 2 d y 



Models propagation of genes. Admits 
separation of variables and corre- 
sponds to shuffles of 1-dimensional 
histories. 

References: [l6ll68ll9Tl . 



C-Ehrenfest 



xd w + yd x + wd y 



Models particles in a cycle of three 
chambers. Associated with ordered 
set partitions and congruence prop- 
erties of block sizes. 
References: |46| . 



Models particles in a symmetric set 

S-Ehrenfest (x + y)d w + (y + w)d x + (w + x)d v of three chambers. 

References: |46| . 



Coupon 



xd w + yd x + yd y 



Models multiple coupon collections. 
Associated with surjections, ordered 
set partitions, and generalized 
Sirling2 numbers. 
References: 1461 1511 . 



Pelican 



xyd w + ywd x + wxd y 



Models the pelican sacrifice model. 
Is solved in terms of Jacobian elliptic 
functions sn, cn. 
References: |36l Ii6l [128] . 



FIGURE 25. Some first-order trivariate operators, in variables w,x,y, 
that are associated with solvable urn models and explicit normal forms, 
together with the main characteristics of the models, after |46| . 



and 
(103) 



ii(x,z)= / p(xi +Ti(x, w),...,x m + T m (x., w))dw. 
Jo 



Proof. Combinatorially, we are now dealing with a collection 71 , . . . , T m of trees of m 
different types. The multivariate extension of the basic framework of gates, Proposition|19| 
shows that the 7} must represent a collection of types of increasing trees satisfying the 
equations (cf Section 6.2 1 

Tj = Z a * 4>j{x\ + 71, . . . ,x m + T m ), 
for j = 1, . . . , m, and 1Z is a "planted" variety of trees corresponding to (cf Section 6.3 1 



TZ = Z a ★ p(x\ + 71, 



Tm). 



(Here, the variables Xj serve to mark the types of leaves.) This gives rise to a set of 
integral equations, 



d_ 

dz 



Tj(x,z)= / (f>i(xi +Ti(x,w),...,x m +T m (x, w))dw 
Jo 



which are clearly equivalent to the differential relations ( 102 l , as well as to the expression 
of R in JT031. □ 
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In terms of partial differential equations, Proposition 24 concerns the solution -F(x, z) 

of 

d m d 

(104) — F(x,z) = y~]fo(zi,. . . ,x m )—— F(x,z) + p(xi, . . .,x m )F(x,z). 

OZ L — ' OXj 

3=1 J 

What is noticeable is the fact that there is a reduction to a collection of ordinary differential 
equations, of which the Ehrenfest urn of Note [23] provides an explicitly solvable case — the 
equations are in general non-linear, though, and integrability is far from being granted. 
This reduction is classically achieved by the well-known method of characteristics 



§1.15]. We then have: the method of characteristics, in the special case (1041 at least, 
admits a combinatorial model, which is that of multitype increasing trees. 

In the particular case where p = and the <j>j are homogeneou^] polynomials of total 
degree s, the solution F(x, z) provides the enumeration of histories of a balanced urn 
model with m colours and balance s. This generalizes the results of Note [24] relative to 
two-colour models. Our equivalence with the combinatorics of increasing trees can then 
be seen as a parallel to the reduction of urn models to multitype branching processes [2] 
§9] ■ 

Explicit iteration formulae (normal forms) for first-order operators in more than 



two variables are somewhat rare: see Figure 25 for some solvable cases associated 
with balanced urn models, drawn from [46] , and the papers by Dumont |36|, 137] 
for some additional examples related to classical combinatorics. Even less is ex- 
plicit in the case of higher order operators. In the quantum physics literature, the 
paper of Agrawal and Mehta [T] shows what can be done with the exponential of 
multivariate quadratic forms in Xj , Dj and it would be of obvious interest to elicit 
its combinatorial content. Similar comments apply to the works of Heffner and 
Louisell [BS], Louisell [SJ PP- 203-207], Marburger [HI [M], and Wilcox [133], to 
name a few. 



10. Perspectives 

As argued throughout this paper, a large number of algebraic identities associ- 
ated with two operators satisfying the partial commutation relation AB — BA = 1 
have a clear combinatorial content, from which many natural and simple proofs 
result. Of course, many, if not most, of our formulae are not new — for instance, 
many are known to Lie theorists — but we feel that the gates-and-diagram model 
has a definite explanatory value and it can serve as a powerful organizing principle 
for families of polynomial identities stemming from the relation AB — BA = 1 (or, 
in our favorite notations, DX — XD = 1). 

There are clearly a great many related combinatorial works that we could not 
discuss in this already long paper. Perhaps one of the most interesting connections 
that we left unexplored is with works of Fomin [52] and Stanley |120j relative 
to modular (graded) graphs and differential posets — these, in particular, provide 
a natural setting for the study of Young tableaux and the Robinson-Schensted 
correspondence. Under the Fomin-Stanley framework, typically, what is given is 
a graded graph satisfying additional properties, where two operators U (for "Up") 
and D (for "Down") enjoy the property DU — UD = rl, for some integer r E 
Z>!. Enumerative properties that lie beyond the purely algebraic operator relations 

31 In the inhomogeneous case, the probabilistic connection with urn processes is lost and we are 
plainly enumerating combinatorial tree families, or, what amounts to the same by Subsection |8.1| 
certain families of weighted lattice paths. 
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studied here can be fruitfully developed within the framework of 52, 120J. (See also 
the many follow-up papers, e.g., [551 I5"4l fl21j .) In this context, our work is to be 
regarded as an elaboration of the particular case r = 1, when the underlying graph 
is the set of nonnegative integers (cf our Section [7| . 

The present paper is one of the many studies that signal the possibility of ap- 
proaching the analysis of certain combinatorial processes by means of linear opera- 
tors bound by suitable algebraic relations. An early instance is Paterson's "Cookie 
Monster" model, which is discussed by Greene and Knuth [63, Ch 3] and serves to 
analyse coalescence phenomena in hashing algorithms. We have briefly evoked the 
loosely related case of Polya urn models in Section [9] Another highly interesting 
instance is the modelling of the exclusion process on an interval (PASEP, TASEP, 
...) by means of dedicated operators D,E satisfying the commutation relation 
DE — ED = D + E, as discovered by Derrida et al. [35]. There is currently an ex- 
tensive literature dedicated to combinatorial aspects of this process; see, typically, 
the works of Corteel et al. [2TJ HSj , Viennot [130] , and references therein to earlier 
works. 

There are finally a few areas that we left largely unexplored and which would 
deserve further consideration. One concerns the role of eigenfunction expansion, 
of which we offered a glimpse in Section [2] when briefly discussing the heat kernel 
through eigenfunctions of the operator e tD . Another is relative to further connec- 
tions with orthogonal expansions and orthogonal polynomials, otherwise known to 
be tightly coupled with the continued fractions of our Section [7] 
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Appendix A. Scherk's dissertation of 1823 

In this appendix, we examine the "inaugural dissertation" jl 13] which Heinrich 
Ferdinand [Henricus Ferdinandus] Scherk defended in 1823 at the University of 
Berlin. The dissertation is in Latin, and, as this language may not be easily under- 
stood by the younger generation, we offer here a brief account of its contents. In 
modern terms, Scherk's thesis can be seen as dealing with the reduction to normal 
form of an expression of the form (yD) n , where various choices for the function 
y = y(x) are considered. The evocative title of the dissertation is, from its front 
page: 

DE 

EVOLVENDA FUNCTIONE 

yd. yd. yd . . . ydX 
dx n 

DISQUISITIONES NONNULLAE ANALYTICAE. 

That is, "Some analytic investigations relative to the expansion of the function 
(yD) n X". Throughout this section, the calligraphic X — X(x) is taken to represent 
an arbitrary function, not to be confused^] with the operator X. 

The memoir is comprised of 10 sections (numbered 1 to 10) and the body of the 
text consists of 36 pages, It is supplemented by a two page table of values of Stirling 
numbers, followed by a two page Vita |113[ pp. 36-37], where we learn that Scherk 
was born on October 27, 1797 in the (now Polish) city of Poznah. It concludes with 
a page containing four assertions (called "theses"), apparently of the author's own 
design, amongst which the second one reads 

Calculi, quos dicunt superiores, in Algebra elementari ponendi sunt. 

A rough rendition is: "What is known as the higher calculus is to be placed within 
the framework of elementary algebra". This excellently summarizes the philosophy 
underlying Scherk's work (as well as ours!). 

We are very much indebted to Dr Morteza Mohammad Noori [103] . from the 
University of Tehran, for drawing our attention to Scherk's thesis. An electronic 
version can be found under the rich and highly useful GDZ site (Gottinger Digital- 
isierungszentrum) , 

http : // gdz . sub . uni-goettingen . de/ , 

from which a large portion of the nineteenth century mathematical literature in 
German is available. The MacTutor reference site for mathematicians' biogra- 
phies has an extended bibliographic notice on Scherk, which can be supplemented 
by the dedicated site hosted by the University of Halle. The url's are: 

http : / /www-history . mcs . st-and . a c .uk/| 

http : //cantor 1 .mathematik.uni-halle . de/history/ scherk/. 

(There we learnt that Scherk's later discovery of the third non-trivial examples of a 
minimal surface brought him considerable fame. He appears to have been otherwise 
interested in Bernoulli and secant numbers, as well as in the number of combinations 
with bounded repetitions.) The Mathematical Genealogy Project indicates that 
Scherk had 19728 descendants (via his famous student, Ernst Kummer), as of July 
2010. 



This is meant to preserve the correspondence with Scherk's original notations. 
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§1. Scherk first notes (p. 1) that the series previously considered mostly involve 
an iterated use of arithmetic operations, especially, multiplication and division. In 
this category, we find series such as 

y^fnX 71 , y2f n x(x- l)---(x-n), J^fn—, — , ^ 1 ; — . — c, 

x(x + 1) ■ ■ ■ (x + n) 

which are of Taylor, Newton-interpolation, and factorial type, respectively — these 
had all been investigated by his time. Scherk then notes that little is known if the 
formation law of the general term of a series involves arbitrary differentiation and 
integration operations. The most general problem, he notes, exceeds by far his own 
strength. Accordingly, he focuses his attention to the special case 

{yd) n x 

dx n ' 

which, in our notations, is (yD) n X, where y — y(x) and X = X(x). He then 
goes on to explain (p. 2) that his purpose will be to express the general term as 

a function of the derivatives X, X', In our terminology, this is equivalent to 

seeking normal forms, where all the occurrences of the derivative D appear on the 
right. 

§2. The problem to be considered now is the case y = x. A small table (p. 2), 

(xD)X = xDX 

{xDfX = xDX + x 2 D 2 X 

{xDfX = xDX + 3x 2 D 2 X + x 3 D 3 X, 

suggests an interesting law. Indeed, the operator formula (with D = and X 
representing as usual multiplication by x; i.e., Xf(x) := xf{x)) 

n 

(105) (XD) n = ^a%X j D j 

3=1 

serves to define a (yet unknown) array of numbers {a™}. (Scherk writes a for what 

3 

we denote by a") 

By considering (xD) n X = (xD) (xD) 11 ^ 1 X , one easily obtains the recurrence 
(p. 3) 

(106) al^alzl+kal- 1 . 

Scherk then derives (p. 4) the (by now classical) alternating sum formula 



1 *1 /fc - 1 



nor) «;= (-d"»- 1 



(k 

' h=0 



his notations being II(fc) = k\ and = (f) = x ,^i x y . 

Next (bottom of p. 4 and top of page 5), Scherk proceeds to introduce two types 
'\ and 'C£, 

We continue using more modern notations and write C£ for what Scherk denotes by ^C^. 



of numbers^ and 'C£, s follows. 
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Scherk 



definition 



modern 



properties 



defined for §2—3 as coefficients in 
the normal form of (XD) n , cf 
Eq. (11051 



h + k 
k 



equivalence with Stirling2 numbers 
is granted via Eq. |107} , assuming 
as known the alternating sum ex- 
pression of Stirling2 numbers 



defined, as in Eq. ( | 1 08p below, as 
elementary symmetric function of 
degree h in the integers 1, 2, . . . , k; 
see ITT31 p. 4] 



ft + 1 
h+1 



equivalence via the (now) classi- 
cal generating function of Stirlingi 
numbers 



defined, as in Eq. | |109| l below, as 
complete homogeneous symmetric 
function of degree h in the integers 
1,2, ...,k; see [113] p. 5] 



h + k 
k 



equivalence via the (now) classi- 
cal generating function of Stirling^ 
numbers 



FIGURE 26. A correspondence table for §1-7: Scherk's notations, his 
definitions, the corresponding modern notations, and corresponding 
properties. 

The quantity G'\ is denned as the sum of combinations without repetitions 
of h of the integers 1,2..., k; in symbols, 

(108) C' h k = 3132' --h- 

l<ji<h<-<jh<k 

The quantity ' C\ is similarly defined as a sum of combinations with repe- 
titions; in symbols, 

(109) 'C\= J2 3\3\---3K- 

l<h<h<-<3h<k 

Remarks. The definitions of the number arrays 'C, C are nowadays easily seen to be equivalent 
to the generating function expressions 

Y C' h n x h = (x + 1) • • • (x + n), Y 'C h n x h = L 

where one can recognize variants of Stirling numbers [25| . 

Scherk then gives (p. 5) two recurrences 

(no) c' h k = c'\_ x + w'th 'c\ = 'c h k _ x + k'c h k -\ 



and he concludes (p. 6) from a comparison of the recurrences (1061 and (110 1 that 

_ )f~in—k 
"n,fe — >-> k i 

which provides the normal form of (XD) n as (his notations, p. 6): 

an) ^=£'crv<^. 

y ' dx n f-^ k dx k 

We stated with modern notations: 



^As it was customary in his time, Scherk does not provide typographically marked statements, 
such as theorems or propositions, but rather plainly quotes his main results en passant. 



COMBINATORIAL MODELS OF CREATION-ANNIHILATION 



75 



Proposition A.l (Scherk |113| p. 6]). The normal form of(XD) n is given by 
{XD) n ^^l U \x k D k , n>l. 

k=X ^ ' 



§3. Scherk continues (pp. 7-8) with various considerations related to setting X = 
—x(l — x) k ^ 1 followed by the substitution x = 1. His goal is to obtain a new proof of 



the normal form result (111). He also rederives in this framework the relation (107) 
directly from the definition of the a%. 

§4. We now come to another problem, namely, that of the reduction of 

(e x D) n X. 

The coefficient^] b^ are determined by the normal form (p. 9): 

n 

(112) (e x D) n X = e nx K° kx 

k=l 



Scherk then derives a recurrence to be compared to our Equation ( 110 ), from which 
there results (p. 10) that 

Thus the b^ are now recognizable as Stirling numbers of the first kind. We state: 
Proposition A. 2. The normal form of (e x D) n is given by 

D k . 



(e x D) n = e 



k=l 



The section concludes with representations of Stirling! numbers in terms of (sort 
of) generalized harmonic numbers. 

§5. This section digresses from the main course of the dissertation (p. 14) and 
is admittedly only loosely related to the other sections. The problem consists in 
finding the laws of the derivatives of a composite function. The formula, com- 
monly attributed to Faa di Bruno (born in 1825(!), publication dated 1850), figures 
explicitly (p. 14); see [25] for a statement. Scherk says: 

expressionem quamdam invenimus, we have discovered a certain formula, 

qualem frustra in multis libris jam which we have very often tried in vain 

seapius quaesiveramus, quam itaque to find in books, so that we feel allowed 

paucis verbis hie tangere liceat. to devote a few words to its description. 

We refer to the scholarly study of Warren Johnson [69], regarding the history of 
this formula, which was once considered a basic component of combinatorial anal- 
ysis. Nowadays, via generating functions, the computation of the nth derivative 
of a composite function f(g(z)) amounts to nothing but the determination of a 
coefficient, 

3 



3=0 J \k=l I 

i.e., it is a simple avatar of the multinomial formula. 

™ In fact, Scherk also denotes the new coefficients by a n we adopt a different convention 
for clarity. 



z 



k 
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§6. Scherk now considers (p. 15) the inverse problem: how to express the deriva- 
tives D n X in terms of the quantities (xD) n X and (e x D) n X. His main formula is 
(p. 18) 

(X n D n )X = ^{-l) h C' h n _ 1 {xD) n - h X. 
h=a 

and a parallel one serves to express e nx D n in terms of the powers (e x Dy , this time 
by means of the Stirling 2 numbers (his 'C). In retrospect, these two formulae are 
manifestations of the fact that the matrices of Stirlingi and Stirling2 are inverse of 
each other. Scherk can then conclude that the coefficients C serve to make explicit 
the law of the successive derivatives of l/(xlogx) (p. 18) and log log a; (p. 19, top): 



d n (-1) 



„_1 /n-l 



\k=0 



n 

k + 1 



(\ogx) k 



k+1 



(yy' 3 + Ay'y'y" + y 3 y"') X' + (7y 2 y' 2 + Ay 3 y")X" + 6y 3 y>X>" + y 4 X" 



§7. This section gives two further consequences (p. 20) of previous developments 
including a formula which expresses the expansion of a function g(z) in terms of 
logz, when z is near 1. 

§8. The purpose is now to come back to the general problem (p. 20) of the reduction 
of (yD) n . Naturally, large symbolic expressions are available, for small n, involving 
y, y' , y" , . . . (The notation used in the memoir is U = y^ .) A typical formula (p. 20) 

i 

reads: 

(yd) 4 x 

A discussion of the combinatorics of the coefficients occupies the next few pages 
(pp. 21-27). Naturally, the previously considered cases y — x and y — e x provide 
various checks. The corresponding coefficients are denoted by v k (written as usual 

k 

as v in the memoir). Scherk deduces various qualitative results on the shape of 

n 

a general expansion: he obtains exact values for the first few coefficients as well 
as some infinite classes of particular coefficients. He then concludes regarding a 
possible discovery of the general law: . 

Solutio itaque ideo tarn difficilis facta Then, the process has become so un- 

est, quod disquisitionem coefficientium tractable that we could not succeed with 

numericorum ab inventione singulorum an investigation of all the numerical co- 

terminorum ipsorum segregare non po- efficients, based on the sole discovery of 

tuirimus. some individual terms. 

Remarks. The symbolic problem is indeed difficult. From the solution described earlier, in our 
Section [6] we now know the following. 

(i) The coefficient of in (yD) n X is obtained from the set of unordered r-forests of 

(rooted, non plane) increasing trees having a t otal size of n, by assigning to a vertex of 
outdegree j a weight equal to y^; see Figure [27] (Trees are taken "unordered" (i.e., 
non-plane) owing to the fact that y(x) is here an exponential generating function.) 
(ii) Symbolically, one should proceed as follows, for an expansion of (yD) n X: 

— Start from y(x) = ■ yjX^ so that yj plays the role of y^ ■ 

— Compute l/y(x), which involves a series of multinomial expansions. 

— Integrate the previous expansion, which only involves a substitution x n >-» x n /n 
for each n: we get in this way $ := f y^ 1 . 
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FIGURE 27. Left: An illustration of the correspondence between 
Scherk's expansion of (yD) n X and forests of trees (case n = 4). Right: 
the decoration of a particular tree by the derivatives of y 



— Take the compositional inverse of the preceding expansion <& — this, by the Lagrange 
inversion theorem only involves a further sequence of multinomial expansions. (We 
determine here the generating function T(x) of increasing trees.) 

— Raise T(x) to the rth power (for the coefficient of A'M), extract the coefficient of 
x n in ^T r , and finally multiply this coefficient by n\ since we are dealing with 
exponential generating functions. 

Barely half a dozen instruction suffice to implement the general procedure in a symbolic manipu- 
lation system such as Maple. We can in this way verify, in a fraction of a second, the correctness 
of the expansion relative to (yD) 5 X, in the form given by Scherk (p. 20). For the benefit of the 
curious reader, here is the outcome of our program in the case of (yD) 6 X: 

X 1 (32y :i y 3 y 1 2 + 34y 2 2 y 3 yi + yoVl 5 + 26 y 2 y 2 yi 3 + H2/1S/4I/0 4 + 15 y 3 y 4 y 2 + 2/52/0 5 ) 
+ X 2 (34y 4 y 2 2 + 31 y 2 yi 4 + 146y 3 2/22/i 2 + 57y 4 yiy 3 +6y 5 y 4 ) 
+ X 3 (90y 3 yi 3 + 120y A yi y 2 + 15y 5 y 3 ) 
+ X i (65y ' l y 1 2 + 20yo 5 y 2 ) + 15X 5 y 5 yi + X 6 y 6 - 

§9. This section starting p. 27 contains a further combinatorial investigation of 
the general case, in the light of the special cases considered earlier in §2-4. Near the 
end, the author offers a brief discussion (pp. 29-30) of the particular reduction of 
(x p D) n . What he obtains is, in present day notations (we write c n ,k for coefficients, 
which Scherk once more denotes by a*): 

Proposition A. 3 (Scherk [4131 pp. 30-34]). The normal form of(X p D) n is given 
by 

(X p D) n = I"^ 1 ' c k n X k D k , 

k 
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where the coefficients c\ satisfy 

4 = [?ip]-[0'2+1)p-1]-[0'3+2)p-2] • • • [(j n -k+n-k-l)p-(n-k-l)}. 

l<jl<32<-<jn-k<k 

§10. This section (pp. 31-36) contains formulae that the author obtained, not 
being at the time cognizant of similar works by the Bernoullis and Laplace. They 
are relative to Bernoulli numbers, hence they will not be discussed further here. 
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